Volume 106, Number 5 


George K. Francis 
Jeffrey R. Weeks 


Roger L. Kraft 
Tom M. Apostol 


David B. Leep 
Gerry Myerson 


Hugh Howards 
Michael Hutchings 
Frank Morgan 


Timothy Y. Chow 


NOTES 
Greg Martin 


Frank K. Kenter 
H. Fejzi¢ 
D. Rinne 


L. J. Lange 
Matthias Beck 


THE EVOLUTION OF... 
Detlef Laugwitz 


PROBLEMS AND 


SOLUTIONS | 


~ REVIEWS 
Dan Schnabel 


Cynthia Woodburn 


TELEGRAPHIC 
REVIEWS 


AN OFFICIAL PUBLICATION OF THE MATHEMATICAL ASSOCIATION OF AMERICA 


THE AMERICAN MATHEMATICAL 


ONTHLY 


Conway’s ZIP Proof 


Chaos, Cantor Sets, and Hyperbolicity for 
the Logistic Maps" 


An Elementary View of Euler’s Summation 
Formula 


Marriage, Magic, and Solitaire 


The Isoperimetric Problem on Surfaces 


What Is a Closed-Form Number? 


The Smallest Solution of 
&(30n + 1) < 4(30n) Is... 


A Matrix Representation for Euler’s 
Constant, y 


More on a Mean Value Theorem Converse 


An Elegant Continued Fraction for 7 


The Reciprocity Law for Dedekind Sums via 
the Constant Ehrhart Coefficient 


Riemann’s Dissertation and Its Effect 
on the Evolution of Mathematics 


Life's Other Secret. By lan Stewart 
The Magical Maze. By lan Stewart 


An Introductory Course in Commutative 
Algebra. By A. W. Chatters and 

C. R. Hajarnavis 

Introduction to Algebra. By Peter J. Cameron 


May 1999 


393 


— 400 


409 


419 


430 


440 


449 


452 


454 


456 
459 


463 


470 


478 
478 


481 


481 
484 


NOTICE TO AUTHORS 


The MONTHLY publishes articles, as well as notes and 
other features, about mathematics and the profes- 
sion. Its readers span a broad spectrum of mathe- 
matical interests, and include professional mathe- 
maticians as well as students of mathematics at all 
collegiate levels. Authors are invited to submit articles 
and notes that bring interesting mathematical ideas 
to a wide audience of MONTHLY readers. 


The MONTHLY’s readers expect a high standard of 
exposition; they expect articles to inform, stimulate, 
challenge, enlighten, and even entertain. MONTHLY 
articles are meant to be read, enjoyed, and dis- 
cussed, rather than just archived. Articles may be 
expositions of old or new results, historical or bio- 
graphical essays, speculations or definitive treat- 
ments, broad developments, or explorations of a sin- 
gle application. Novelty and generality are far less 
important than clarity of exposition and broad appeal. 
Appropriate figures, diagrams, and photographs are 
encouraged. 


Notes are short, sharply focussed, and possibly infor- 
mal. They are often gems that provide a new proof of 
an old theorem, a novel presentation of a familiar 
theme, or a lively discussion of a single issue. 


Articles and Notes should be sent to the Editor: 


ROGER A. HORN 

1515 Mineral Square, Room 142 
University of Utah 

Salt Lake City, UT 84112 


Please send your email address and 3 copies of the 
complete manuscript (including all figures with cap- 
tions and lettering), typewritten on only one side of 
the paper. In addition, send one original copy of all 
figures without lettering, drawn carefully in black ink 
on separate sheets of paper. Authors who use LaTex 
are urged to use article.sty and its standard environ- 
ments with no custom formatting 


Letters to the Editor on any topic are invited; please 
send to the MONTHLY’s Utah office. Comments, criti- 
cisms, and suggestions for making the MONTHLY 
more lively, entertaining, and informative are wel- 
come. 


See the MONTHLY section of MAA Online for current 
information such as contents of issues and descrip- 
tive summaries of forthcoming articles: 


http: // www.maa.org / 


Proposed problems or solutions should be sent to: 


DANIEL ULLMAN, MONTHLY Problems 
Department of Mathematics 

The George Washington University 
2201 G Street, NW, Room 428A 
Washington, DC 20052 


Please send 2 copies of all problems/solutions mate- 
rial, typewritten on only one side of the paper. 


EDITOR: ROGER A. HORN 
monthly@math.utah.edu 


ASSOCIATE EDITORS: 


WILLIAM ADKINS VICTOR KATZ 
DONNA BEERS STEVEN KRANTZ 
HAROLD BOAS JIMMIE LAWSON 
RICHARD BUMBY RICHARD NOWAKOWSKI 
JAMES CASE ARNOLD OSTEBEE 
JANE DAY KAREN PARSHALL 
JOHN DUNCAN EDWARD SCHEINERMAN 
PETER DUREN ABE SHENITZER 
GERALD EDGAR WALTER STROMQUIST 
JOHN EWING ALAN TUCKER 
JOSEPH GALLIAN DANIEL ULLMAN 
ROBERT GREENE ‘DANIEL VELLEMAN 
RICHARD GUY ANN WATKINS 
PAUL HALMOS DOUGLAS WEST 
GUERSHON HAREL HERBERT WILF 
DAVID HOAGLIN 
EDITORIAL ASSISTANTS: 

ARLEE CRAPO 

MEGAN TONKOVICH 


Reprint permission: 
DONALD ALBERS, Director of Publication 


Advertising Correspondence: 
Dave Riska, driska@maa.org 


Change of address, missing issues inquiries, and 
other subscription correspondence: 
MAA Service Center, maahq@maa.org 


All at the address: 


The Mathematical Association of America 
1529 Eighteenth Street, N.W. 
Washington, DC 20036 


Recent copies of the Monthly are available for pur- 
chase through tho MAA customer service center, 
maahq@maa.org, 1-800-331-1622 


Microfilm Editions: University Microfilms International, 
Serial Bid coordinator, 300 North Zeeb Road, Ann 
Arbor, MI 48106. 


The AMERICAN MATHEMATICAL MONTHLY (ISSN 
0002-9890) is published monthly except bimonthly 
June-July and August-September by the Mathemati- 
cal Association of America at 1529 Eighteenth Street, 
N.W., Washington, DC 200386 and Montpelier, VT. 
Copyrighted by the Mathematical Association of 
America (Incorporated), 1999, including rights to this 
journal issue as a whole and, except where otherwise 
noted, rights to each individual contribution. ‘‘Permis- 
sion to make copies of individual articles, in paper or 
electronic form, including posting on personal and 
class web pages, for educational and scientific use is 
granted without fee provided that copies are not made 
or distributed for profit or commercial advantage and 
that copies bear the following copyright notice: 
[Copyright the Mathematical Association of America 
1999. All rights reserved.] Abstracting, with credit is 
permitted. To copy otherwise or to republish, re- 
quires specific permission of the MAA’s Director of 
Publication and possibly a fee.’’ Second class postage 
paid at Washington, DC, and additional mailing of- 
fices. Postmaster: Send address changes to the 
American Mathematical Monthly, Membership / 
Subscription Department, MAA, 1529 Eighteenth 
Street, N.W., Washington, DC, 20036-1385. 


Conway’s ZIP Proof 


George K. Francis and Jeffrey R. Weeks 


Surfaces arise naturally in many different forms, in branches of mathematics 
ranging from complex analysis to dynamical systems. The Classification Theorem, 
known since the 1860’s, asserts that all closed surfaces, despite their diverse origins 
and seemingly diverse forms, are topologically equivalent to spheres with some 
number of handles or crosscaps. The proofs found in most modern textbooks 
follow that of Seifert and Threlfall [5]. Seifert and Threlfall’s proof, while satisfy- 
ingly constructive, requires that a given surface be brought into a somewhat 
artificial standard form. Here we present a completely new proof, discovered by 
John H. Conway in about 1992, which retains the constructive nature of [5] while 
eliminating the irrelevancies of the standard form. Conway calls it his Zero 
Irrelevancy Proof, or “ZIP proof,” and asks that it always be called by this name, 
remarking that “otherwise there’s a real danger that its origin would be lost, since 
everyone who hears it immediately regards it as the obvious proof.” We trust that 
Conway’s ingenious proof will replace the customary textbook repetition of 
Seifert-Threlfall in favor of a lighter, fat-free nouvelle cuisine approach that retains 
all the classical flavor of elementary topology. 

We work in the realm of topology, where surfaces may be freely stretched and 
deformed. For example, a sphere and an ellipsoid are topologically equivalent, 
because one may be smoothly deformed into the other. But a sphere and a 
doughnut surface are topologically different, because no such deformation is 
possible. All of our figures depict deformations of surfaces. For example, the 
square with two holes in Figure 1A is topologically equivalent to the square with 
two tubes (1B), because one may be deformed to the other. More generally, two 
surfaces are considered equivalent, or homeomorphic, if and only if one may be 


Figure 1. Handle Figure 2. Crosshandle 


1999] | CONWAY’S ZIP PROOF 393 


mapped onto the other in a continuous, one-to-one fashion. That is, it’s the final 
equivalence that counts, whether or not it was obtained via a deformation. 

Let us introduce the primitive topological features in terms of zippers or 
“zip-pairs,”’ a zip being half a zipper. Figure 1A shows a surface with two boundary 
circles, each with a zip. Zip the zips, and the surface acquires a handle (1D). If we 
reverse the direction of one of the zips (2A), then one of the tubes must “pass 
through itself’ (2B) to get the zip orientations to match. Figure 2B shows the 
self-intersecting tube with a vertical slice temporarily removed, so the reader may 
see its structure more easily. Zipping the zips (2C) yields a cross handle (2D). This 
picture of a crosshandle contains a line of self-intersection. The self-intersection is 
an interesting feature of the surface’s placement in 3-dimensional space, but has 
no effect on the intrinsic topology of the surface itself. 

If the zips occupy two halves of a single boundary circle (Figure 3A), and their 
Orientations are consistent, then we get a cap (3C), which is topologically 
trivial (3D) and won’t be considered further. If the zip orientations are inconsis- 
tent (4A), the result is more interesting. We deform the surface so that correspond- 
ing points on the two zips lie opposite one another (4B), and begin zipping. At first 


Figure 3. Cap Figure 4. Crosscap 


the zipper head moves uneventfully upward (4C), but upon reaching the top it 
starts downward, zipping together the “other two sheets” and creating a line of 
self-intersection. As before, the self-intersection is merely an artifact of the 
model, and has no effect on the intrinsic topology of the surface. The result is a 
crosscap (4D), shown here with a cut-away view to make the self-intersections 
clearer. 

The preceding constructions should make the concept of a surface clear to 
non-specialists. Specialists may note that our surfaces are compact, and may have 
boundary. 


Comment. A surface is not assumed to be connected. 


Comment. Figure 5 shows an example of a triangulated surface. All surfaces may 
be triangulated, but the proof [4] is difficult. Instead we may consider the 
Classification Theorem to be a statement about surfaces that have already been 
triangulated. 


394 CONWAY’S ZIP PROOF [May 


Figure 5. Install a zip-pair along each edge of the triangulation, unzip them all, and then re-zip them 
one at a time. 


Definition. A perforation is what’s left when you remove an open disk from a 
surface. For example, Figure 1A shows a portion of a surface with two perfora- 
tions. 


Definition. A surface is ordinary if it is homeomorphic to a finite collection of 
spheres, each with a finite number of handles, crosshandles, crosscaps, and 
perforations. 


Classification Theorem (preliminary version). Every surface is ordinary. 


Proof: Begin with an arbitrary triangulated surface. Imagine it as a patchwork 
quilt, only instead of imagining traditional square patches of material held together 
with stitching, imagine triangular patches held together with zip-pairs (Figure 5). 
Unzip all the zip-pairs, and the surface falls into a collection of triangles with zips 
along their edges. This collection of triangles is an ordinary surface, because each 
triangle is homeomorphic to a sphere with a single perforation. Now re-zip one zip 
to its original mate. It’s not hard to show that the resulting surface must again be 
ordinary, but for clarity we postpone the details to Lemma 1. Continue re-zipping 
the zips to their original mates, one pair at a time, noting that at each step 
Lemma 1 ensures that the surface remains ordinary. When the last zip-pair is 
zipped, the original surface is restored, and is seen to be ordinary. s 


Lemma 1. Consider a surface with two zips attached to portions of its boundary. If the 
surface is ordinary before the zips are zipped together, it is ordinary afterwards as well. 


Proof: First consider the case that each of the two zips completely occupies a 
boundary circle. If the two boundary circles lie on the same connected component 
of the surface, then the surface may be deformed so that the boundary circles are 
adjacent to one another, and zipping them together converts them into either a 
handle (Figure 1) or a crosshandle (Figure 2), according to their relative orienta- 
tion. If the two boundary circles lie on different connected components, then 
zipping them together joins the two components into one. 


1999] CONWAY ’S ZIP PROOF 395 


Next consider the case that the two zips share a single boundary circle, which 
they occupy completely. Zipping them together creates either a cap (Figure 3) or a 
crosscap (Figure 4), according to their relative orientation. 

Finally, consider the various cases in which the zips needn’t completely occupy 
their boundary circle(s), but may leave gaps. For example, zipping together the zips 
in Figure 6A converts two perforations into a handle with a perforation on 
top (6B). The perforation may then be slid free of the handle (6C, 6D). Returning 
to the general case of two zips that needn’t completely occupy their boundary 


Figure 6. These zips only partially occupy the boundary circles, so zipping them yields a handle with a 
puncture. 


circle(s), imagine that those two zips retain their normal size, while all other zips 
shrink to a size so small that we can’t see them with our eyeglasses off. This 
reduces us (with our eyeglasses still off!) to the case of two zips that do completely 
occupy their boundary circle(s), so we zip them and obtain a handle, crosshandle, 
cap, or crosscap, as illustrated in Figures 1-4. When we put our eyeglasses back 
on, we notice that the surface has small perforations as well, which we now restore 
to their original size. = 


The following two lemmas express the relationships among handles, crosshan- 
dies, and crosscaps. 


Lemma 2. A crosshandle is homeomorphic to two crosscaps. 


Proof: Consider a surface with a “Klein perforation” (Figure 7A). If the parallel 
zips (shown with black arrows in 7A) are zipped first, the perforation splits in 
two (7B). Zipping the remaining zips yields a crosshandle (7C). 

If, on the other hand, the antiparallel zips (shown with white arrows in 
Figure 7A) are zipped first, we get a perforation with a “Mobius bridge” (7D). 
Raising its boundary to a constant height, while letting the surface droop below it, 
yields the bottom half of a crosscap (7E). Temporarily fill in the top half of the 
crosscap with an “invisible disk” (7F), slide the disk free of the crosscap’s line of 


396 CONWAY’S ZIP PROOF [May 


Figure 7. A crosshandle is homeomorphic to two crosscaps. 


self-intersection (7G), and then remove the temporary disk. Slide the perforation 
off the crosscap (7H) and zip the remaining zip-pair (shown with black arrows) to 
create a second crosscap (71). | 

The intrinsic topology of the surface does not depend on which zip-pair is 
zipped first, so we conclude that the crosshandle (7C) is homeomorphic to two 
crosscaps (71). ma 


Lemma 3 (Dyck’s Theorem [1]). Handles and crosshandles are equivalent in the 
presence of a crosscap. 


Proof: Consider a pair of perforations with zips installed as in Figure 8A. If, on the 
one hand, the black arrows are zipped first (8B), we get a handle along with 
instructions for a crosscap. If, on the other hand, one tube crosses through itself 
(8C, recall also Figure 2B) and the white arrows are zipped first, we get a 
crosshandle with instructions for a crosscap (8D). In both cases, of course, the 
crosscap may be slid free of the handle or crosshandle, just as the perforation was 
slid free of the handle in Figure 6BCD. Thus a handle-with-crosscap is homeomor- 
phic to a crosshandle-with-crosscap. ES) 


Classification Theorem. Every connected closed surface is homeomorphic to either a 
sphere with crosscaps or a sphere with handles. 


Proof: By the preliminary version of the Classification Theorem, a connected 
closed surface is homeomorphic to a sphere with handles, crosshandles, and 
crosscaps. 

Case 1: At least one crosshandle or crosscap is present. Each crosshandle is 
homeomorphic to two crosscaps (Lemma 2), so the surface as a whole is homeo- 
morphic to a sphere with crosscaps and handles only. At least one crosscap 


1999] CONWAY’S ZIP PROOF 397 


Figure 8. The presence of a crosscap makes a handle cross. 


is present, so each handle is equivalent to a crosshandle (Lemma 3), which is in 
turn homeomorphic to two crosscaps (Lemma 2), resulting in a sphere with 
crosscaps only. 

Case 2: No crosshandle or crosscap is present. The surface is homeomorphic to 
a sphere with handles only. 

We have shown that every connected closed surface is homeomorphic to either 
a sphere with crosscaps or a sphere with handles. io 


Comment. The surfaces named in the Classification Theorem are all topologically 
distinct, and may be recognized by their orientability and Euler number. A sphere 
with n handles is orientable with Euler number 2 — 2n, while a sphere with n 
crosscaps is nonorientable with Euler number 2 — n. Most topology books provide 
details; elementary introductions appear in [6] and [2]. 


Nomenclature. A sphere with one handle is a torus, a sphere with two handles is a 
double torus, with three handles a triple torus, and so on. A sphere with one 
crosscap has traditionally been called a real projective plane. That name is 
appropriate in the study of projective geometry, when an affine structure is 
present, but is inappropriate for a purely topological object. Instead, Conway 
proposes that a sphere with one crosscap be called a cross surface. The name cross 
surface evokes not only the crosscap, but also the surface’s elegant alternative 
construction as a sphere with antipodal points identified. A sphere with two 
crosscaps then becomes a double cross surface, with three crosscaps a triple cross 
surface, and so on. As special cases, the double cross surface is often called a Klein 
bottle, and the triple cross surface is often called Dyck’s surface [3]. 


REFERENCES 
1. W. Dyck, Beitrage zur Analysis situs I, Math. Ann. 32 (1888) 459-512. 


2. D. Farmer and T. Stanford, Knots and Surfaces, American Mathematical Society, Providence, RI, 
1996. 


398 CONWAY’S ZIP PROOF [May 


3. G. Francis and B. Collins, On knot-spanning surfaces: An illustrated essay on topological art. In 
Michele Emmer, editor, The Visual Mind: Art and Mathematics, chapter 11, MIT Press, Cambridge, 
MA, 1993. 

4. T. Rado, Uber den Begriff der Riemannschen Flache, Acta Litt. Sci. Szeged (1925), 101-121. 

5. H. Seifert and W. Threlfall, Lehrbuch der Topologie. Teubner, Leipzig, 1934. Translated into 
English as A Textbook of Topology, Academic Press, New York, 1980. 

6. J. Weeks, The Shape of Space. Marcel Dekker, New York, 1985. 


GEORGE FRANCIS is professor of mathematics, professor in the Campus Honors Program, and senior 
research scientist at the National Center for Supercomputing Applications at the University of Illinois 
at Urbana-Champaign. Francis received his BSmcl from Notre Dame in 1958, an A.M. from Harvard in 
1960, and a Ph.D. in mathematics from the University of Michigan in 1967. His research fields are 
descriptive topology, geometrical computer graphics, and immersive virtual environments. A Topologi- 
cal Picturebook of Francis’ drawings by hand and computer has been translated into Japanese and 
Russian. 

University of Illinois at Urbana-Champaign, Urbana, IL 61801 

gfrancis@math.uiuc.edu 


JEFF WEEKS is an independent consultant living in Canton, NY. He has an A.B. from Dartmouth 
College and a Ph.D. from Princeton University, both in mathematics, and splits his time among 
research, education, and his family. Nominally a topologist and geometer, he has recently fallen in with 
a group of cosmologists hoping to determine the global topology of the universe from the cosmic 
microwave background radiation. 

15 Farmer Street, Canton, NY 13617 

weeks @northnet.org 


A Problem From the MONTHLY 100 Years Ago 


114. Proposed by F. P. MATZ, M.Sc., Ph.D., Professor of Mathe- 
matics and Astronomy, Irving College, Mechanicsburg, Pa. 


Does it pay a $4-carpenter using a dozen four-penny nails per 
minute, to pick up a dropped nail? At this rate, should twenty 
penny nails be picked up? 


MONTHLY 6 (1899) 237 


Editors note: In the printed solution to the problem (by the 
MONTHLY’s editor, Benjamin Finkel) one discovers that the carpen- 
ter was paid $4 per day and that four-penny nails cost 5 cents per 
pound. 


1999] CONWAY’S ZIP PROOF 399 


Chaos, Cantor Sets, and Hyperbolicity 
for the Logistic Maps 


Roger L. Kraft 


The family of logistic maps f,(x) = wx(1 — x) appears in almost every dynamical 
systems textbook. It is one of the simplest nonlinear systems that one can study, 
but it is amazingly rich in phenomena. It has a surprising number of connections to 
other topics in dynamical systems and applied mathematics, for example, popula- 
tion dynamics, symbolic dynamics, complex analytic dynamics, the Mandelbrot set, 
the period-doubling route to chaos, renormalization, universality, homoclinic bifur- 
cations, horseshoes, and invariant measures. Because of its simplicity, many 
introductory dynamical systems textbooks use it as a primary example, in particular 
as the primary example of a chaotic dynamical system. When up > 2 + V5 = 4.236, 
it is not too difficult to prove that f,, is chaotic on an invariant Cantor set; for the 
details, see any of [1, pp. 31-50], [3, pp. 112-126], or [4, pp. 69-85]. Each of these 
books states without proof that f,, is actually chaotic for all > 4. Our goal is to 
give a simple proof of this fact. 

As far as I know, only one textbook gives a proof that f,, is chaotic for w > 4 
[6, pp. 33-37]. However, its proof uses the Poincaré hyperbolic metric on the unit 
interval, the calculation of a derivative using different metrics, and the Schwarz 
Lemma from the theory of complex variables. While this proof is very elegant, and 
hints at the connections between the logistic maps and complex analytic dynamics, 
it is not in the spirit of the more elementary books. 

The family of logistic maps f,,:K — R, w > 0, is a family of parabolas that 
open downward, intercept the x-axis at 0 and 1, and have a maximum at 1/2. Since 
the maximum value is 4/4, f,, maps the interval [0, 1] into [0,1] when 0 < p < 4. 
But when yu > 4, there are points in [0,1] that escape from [0,1] under forward 
iteration of f,,. Let 


= Gt [0, 1]). 


For « > 4, A, contains exactly those points in [0,1] that never escape under 
forward iteration by f,. Our main result is: 


Theorem 1. /f > 4, then A,, is a Cantor set, and the restriction of f, to A 
is chaotic. 


Bb 


Once we have shown that A,, is a Cantor set, the proof that the restriction of f, 
to A,, is chaotic is same as in the case uw > 2 + V5: Use itineraries to construct a 
topological conjugacy between f,, on A, and the shift map o on %, = {0, 1}'; this 
shows that f,, on A,, Is topologically transitive and has dense periodic points. It 
is easy to show that f,, on A, has sensitive dependence on initial conditions; in 
fact, it is easy to show that it is expansive, which is a stronger property [1, p. 50], 
[6, p. 83]. All the details of these steps remain unchanged. 


400 CHAOS, CANTOR SETS, AND HYPERBOLICITY FOR THE LOGISTIC MAPS [May 


1 a 
Figure 1. 


Choose a value of uw > 4, and let us define some notation. Let gy < q, solve 
fi (x) = 1; see Figure 1. Let J) = [0,1], and let 7, = [0,q ] U [q,, 1]. Notice that 
ia =f, \(I,), In general, let a =f, =f," (hy). Then J, is exactly those 
points in [0,1] that stay in [0, 1] for their first n iterates under fe and. 1, C1, 24: 
Notice also that J, is made up of 2” disjoint closed intervals. Order the 2” 


components of I, from left to right, and let I, ; denote the j-th component. (To 
keep the notation as simple as possible, we suppress the explicit dependence of 
objects like J, on the parameter mu.) For more details about the definitions in this 
paragraph, see [1, pp. 34-36], [3, pp. 112-114], [4, pp. 70-73], or [6, pp. 30-32]. 

If J is an interval, we let |I| denote the length of J. 

Recall that a subset of the real line is a Cantor set if it is compact, perfect, and 
totally disconnected. Recall also that a subset of the real line is totally disconnected 
if and only if it does not contain any intervals; see [1, p. 37], [3, p. 116], [4, p. 73], or 
[6, p. 26]. 

The first step in proving that A, is a Cantor set is the following lemma. 


Lemma 2. If > 4, then A,, is a compact perfect set. 


Proof: Since A, = ,,-o/, and each I, is compact, we know that A, is compact. 
To show that A, is perfect, first notice that for every n, all the endpoints of J, are 
contained in A,. Let x € A,,, and for each n let I, , denote the component of LT. 
that contains x. If |, ,| > 0 as n>, then there are endpoints from [, ; 
arbitrarily close to x, so x is in the closure of A w \ Uh. On the other hand, if 
I,,;,| does not go to 0 as n>, then Pie cade is a closed interval, and 


x © Nixon, j;, C A,» $0 Once again x is in the closure of A » \ {x}. This shows that 


A,, Is perfect. a 


To finish the proof that A, is a Cantor set, we need to show that it does not 
contain any intervals. How is ‘this done when » > 2 + V5? A simple calculation 
shows that when w = 2+ V5, fildo) =1. So if w>2+ V5, then Peed 
for all x € J,. This key fact makes the case w > 2+ y5 straightforward, as we 
now show. 


1999| CHAOS, CANTOR SETS, AND HYPERBOLICITY FOR THE LOGISTIC MAPS 401 


Lemma 3. If «> 2+ 75, then A,, is a Cantor set. 


Proof: Suppose that A,, contains an interval; let [a,b] C A,. For every n > 1, the 
Mean Value Theorem applied to f? on the interval [a, b] ensures that there is a 
point c, € (a, b) such that 


LO) =H) (ee). 
Let A = fi(qo), so [fi(x)| = A for all x € I,. Since w>2+ 75, we have A> 1. 


Since c, E [a, bic A, , we have file, € A, CJ, for all O<i<n-—1. There- 
fore, |( pry"(c,)I > Xr" ‘by the chain tule, and 
JAZ(b) — f2(@)| =|(A2) (en) | 18 - a] = A" |b - a]. 
Since A > 1, this implies that |f(b) — f?(a@)| > 1 for all n sufficiently large. But 
A,, is invariant, and hence {f"(a), f"(b)} C A, ¢ [0,1] for all n, so we have a 
contradiction. Thus A,, does not contain any intervals, and hence is a Cantor set. 
a 


Here is another way to think about this proof: If «4 > 2 + 75, and we apply ee : 
to I, to get I, HD then i shrinks the length of every component of J, by at least 
the amount A~' <1, so “the lengths of the components of J, go to zero as n goes 
to infinity. 

When p &€ (4, 2 + V5], we have If (x) > 1 for some x € J,, but fi. (x)| <1 
for other x € J,, When we apply fi! to J, to get Tai, fi, ; shrinks some 
components of J,, but, in contrast to the case when wp > 2+ V5, tae may also 
stretch other components of J,. This combination of shrinking and Stretching by 
i is what makes it difficult to show that A, isa Cantor set when 4 < wp <2 + V5. 
However, a little playing around with f,, should give one the sense that somehow, 
the stretching is eventually dominated by the shrinking as we repeatedly apply f,,”. : 
This leads to the following important definition. 


Definition. Let f:R — R be a C' function, and suppose that A is a compact 
invariant set for f (ie., f(A) = A). Then A is a hyperbolic set for f if there are 
constants C > 0 and A> 1 such that |(f")’(x)| > CA” for all x € A and all 
n>. 


The C in the definition takes care of the fact that f~' may stretch some 
intervals (i.e., |f’(x)| < 1 for some x € A), in which case C < 1, but A > 1 implies 
that shrinking under f-” eventually dominates any stretching when CA" > 1; see 
[6, pp. 107-108 and p. 156]. 

The following lemma gives some insight into the definition of hyperbolicity, and 
makes it easier to use. 


Lemma 4. Let f:R—R be a C' function, and suppose that A is a compact 
invariant set for f. Then the following are equivalent. 


(1) There are constants C > 0 and A} > 1 such that |(f")'(x)| => CA" for all 
x € A andalln > 1. 

(2) There is an integer N > 1 such that |(f")'(x)| > 1. for allx € A and all 
n>QN, 

(3) There is an integer n, => 1 such that \(f"°)'(x)| > 1 forallx € A. 

(4) For everyx € A there is an integer n, > 1, which may depend on x, such that 


F™)'GOI > 1. 


402 CHAOS, CANTOR SETS, :AND HYPERBOLICITY FOR THE LOGISTIC MAPS [May 


Remark. If |f'(x)| > 1 for all x © A, then it is obvious that all four of the 
conditions in the lemma are true. This emphasizes once again that it is the 
possibility that |f’(x)| < 1 for some x € A that makes the definition of hyperbolic- 
ity subtle. 


Proof: (4) = (3) [5, p. 220] Since f is C', (f")’ is continuous for every n. For each 
x € A, |(f"*)'(x)| > 1 and (f"*)’ is continuous, so there is neighborhood U, of x 
and a A, > 1 such that |(f"*)’(y)| > A, for all y € U,.. The open sets {U.|x © A} 
cover the compact set A, so there is a finite subcover {U}*_,, numbers {A}*, 
all strictly greater than 1, and integers {n,}_, such that |(f")'(y)| > A, for all 
y €U. Let 


y=max{n,...n,}, Ag = min{A,...A,}, and m= min{|f'(x)]}, 
xEA 


so m > 0 (why?). Choose an integer k so that Afm” > 1, and let ny =kv + v. 
Now that we have defined our global choice for mj, we need to show that 
\(f"°)'(x)| > 1 for all x € A. If we imagine that A, represents “good” derivatives 
(A, > 1) and m represents “bad” derivatives (m < 1), then we need to show that 
f"(x) contains at least k iterates with good derivatives to compensate for the 
worst case of v iterates with bad derivatives. 

Choose x € A and perform the following selection process that depends on x 
and terminates after a finite number of steps: 

Choose v, so that x © U,. Now suppose that we are given {v,,..., v,}. Let 
n = Lj_yn,,. Choose v;,, so that P(x) EU, . If n+ 444, > kv, then stop; 
otherwise, g0 on to choose »,, 5. 

If the selection as stops after j steps, then kv < LJ_in, <kv+ v. Write 
Ng =n, tn, + tn, +1,, where 0 <i, < v. Each n, represents a good 
iterate (derivative > 1), J represents how many good iterates we actually have, 
and i, represents how many bad iterates (derivatives < 1) we actually have. Since 
each n, < v, we know that j > k. Using the chain rule, we can estimate |(f"°)'(x)|: 


FY’) = (Pe f (fs. £4) (2) 


> mr, Ning RONG, (by the properties of the subcover {U};-,) 
= mM’, Ay, (since m < Landi, < v) 

> mri (by our choice of Ag) 

> m’'AG | (since j => k and Ay > 1) 

>1 (because of our choice of k). 


Thus, |(f”°)’(x)| > 1 for all x € A, which proves (3). 


(3) > (2) [1, p. 99] If n, =1 in @), there is nothing to prove, so suppose 
ny > 1. Let 


ee min {|(f"°)'(x) } and m= min {| f'(x) I}, 


so A > 1 (why?) and m > 0. Since we are assuming that n, > 1, we must have 
< 1. Choose k so that m™~"')* > 1. Let N=n,k + (n, — 1). If n > N, write 


1999| CHAOS, CANTOR SETS, AND HYPERBOLICITY FOR THE LOGISTIC MAPS 403 


n=n(k + v) +i, where v > 0 and 0 <i <n, — 1. Then for any x € A we have 


CF") (x) | =F PY (FC) ACE) | 


ee 
>An*m"-! ~— (since m < 1 andi <n, — 1) 
> dr” (by our choice of k) 

ad (since A > 1). 


(2) = (1) If N = 1 in (2), there is nothing to prove, so suppose N > 1. Let 
m, = min {|(f)'(x)]} and m= min{|f’(x)|}, 
xEA | xe 
SO m, > 1. Since we are assuming that N > 1, we must have m < 1. Let 
A=mV/N and C=(m/d)*"', 


so A > 1 and C > 0. For any n > 0 write n = kN +i, where Kk > 0 and0 <i< 
N — 1. Then for any x € A we have 


KF) (x) =F) (Fx) FY) (2) | 


k 


>mym' 


= Nm! (by our choice of A) 

= a (m/a)’ 

> MN+i(msr)** (since m/A < Landi <N-—1) 
= CA" (by our choice of C). 


(1) = (4) Choose n large enough so that CA” > 1. Then we have |(f")’(x)| > CA" 
> 1. Now let n, =n for every x € A. i 


Why so many versions of the definition of hyperbolic? When we want to prove 
that a set is hyperbolic, it helps to use the weakest version of the definition, (4). On 
the other hand, when we want to prove general conclusions about a hyperbolic set, 
then it helps to use the strongest version, (1). Also, (2) is used as the definition of a 
hyperbolic set in some textbooks when the emphasis is on dynamics in one 
dimension, e.g., [1, p. 38], or [4, p.. 77]. But a generalization of (1) is used in the 
definition of hyperbolicity for higher dimensions, e.g., [6, p. 241]. 

When p > 2 + V5, we have fiC)| > 1 for all x € J,, and this is the key to 
proving that A, is a Cantor set. To prove that A, is a Cantor set when pw > 4, we 
need to replace “|fi(x)| > 1 for all x € J,” with “A,, is a hyperbolic set for f,.” 
Before we can begin the proof of hyperbolicity, we need to introduce an important 
tool, the Schwarzian derivative. 


Definition. The Schwarzian derivative of a C’ function f at a point x where 
f'(x) # 0 is 


404 CHAOS, CANTOR SETS, AND HYPERBOLICITY FOR THE LOGISTIC MAPS [May 


This strange definition turns out to be tremendously useful. Our first result 
about the Schwarzian derivative is the following lemma. 


Lemma 5. For the logistic family f,, with w > 0, 


(1) Sf Cx) <0 for allx € R\ {1/2}, 
(2) Sf"(x) <0 foralln > 1 andallx © R\ Ul f7'./2). 


The first item is easy, since Ti = (0. However, the second is not so obvious, since 


f, is a polynomial of degree 2”. The second item follows from the first item, the 


following lemma, and induction. This “hereditary” result is one of the reasons the 
Schwarzian derivative is so useful. 


Lemma 6. If g'(x) # 0 and f'(g(x)) # 0, then 


S(f°g)(x) = Sf(g(x)) -(8'(x))" + Sg(x). 
So if Se(x) < 0 and Sf(g(x)) < 0, then S(f° gx) < 0. 


Proof: The chain rule gives 
(fog)'(x) =f'(e(*))8'(x), 
(f° g)'(x) =f"(a(x))(8'(x))° + f'(e(x))e"(2), and 
(feg)"(x) =F"(a(*))(g'(x))° + 3f"(8(*))8"(x)8'(x) + f'(8(4))8"(4). 


A computation now gives the desired result. i 


We say that a function f has negative Schwarzian derivative on an interval I it 
f'(x) #0 and Sf(x) < 0 for all x € J; we abbreviate this as Sf <0 on J. The 
following lemma gives a geometric consequence of negative Schwarzian derivative. 


Lemma 7. /f I is an open interval and Sf < 0 on I, then f' cannot have a positive 
local minimum on I, nor can it have a negative local maximum. 


Proof: Suppose that x is a positive local minimum point for f’ on J. Then 
f'(x) > 0, f"(x) = 0, and f(x) => 0 (why?). This implies that Sf(x) > 0, which 

contradicts Sf < 0 on I. 
Similarly, if x is a negative local maximum point for f’ on J, then f’(x) < 0, 
f"(x) = 0, and f(x) < 0. This implies Sf(x) = 0, which contradicts Sf < 0 on J. 
is 


Lemma 8 (Minimum Principle). Let I = [a,b] and suppose f is C* on I. If Sf < 0 
on (a, b), then |f'(x)| > min{|f’(@)|, |f’Cb)|} for all x © (a, b). 


Proof: Since |f'| is continuous on the closed interval J, it must have a minimum at 
some point x, € J. If x) € (a,b), then f’(x)) #0 since Sf <0 on (a,b). If 
f(x) > 0, then f’ has a positive local minimum on (a, b), which contradicts 
Lemma 7. On the other hand, if f’(x,)) < 0, then f’ has a negative local maximum 
on (a, b), another contradiction of Lemma 7. It follows that x» =aorx,=b. JW 


The Minimum Principle is the key result we need about negative Schwarzian 
derivatives. When we apply it to the iterates f/’, we get information about the 


1999] CHAOS, CANTOR SETS, AND HYPERBOLICITY FOR THE LOGISTIC MAPS 405 


shape of the graph of f/’ between its critical points that would be very difficult to 
get in any other way. 

There are other important consequences for one-dimensional dynamical systems 
of negative Schwarzian derivative; see [1, Section 1.11]. 

Now consider f,,. For any «> 0, f, has a fixed point at p,; = 1 — (1/w), and 
filp,) =2— pm, so |fi(p)| > 1 when w > 3. Let pp =1/m, so py and p, are 
symmetric about 1/2; see Figure 2. Notice that f,(py) =p, (when pw > 2), and 
that f,. po, do) =f, piD =[p,, 1]. So if we let J = (py, qo) U (41, py), and if 
x is any point in J, then fA) € J. But we have the following “return lemma.” 


0 Po dg q Py t 


Figure 2. 


Lemma 9 (Return Lemma). If w > 4 and if x € J, then there is an integer n > 2 
such that f(x) = [po P1)- 


Proof: Choose x € J, so f(x) © (p;, 1) and f?(x) € (0, py). If fix) € [po py; 
then we are done. Suppose that f(x) € (, Po): We claim that for some n > 1, 
fi*"(x) is in L Po» Pr). Suppose not. Since f(z) > z for all z € (0, py), we know 
that f;*"(x) is an increasing sequence bounded from above by py. So fr*"(x) 
converges as n — © to some point Z,) < po. It follows that z, is a fixed point for 
f,. But 0 < zy < p,, so we have a contradiction. = 


Lemma 10. Jf u > 4, then qy — Poy < Po. So the intervals (py, qo) and (q,, p;) are 
shorter than the intervals (0, p,) and (p,, 1). 


Proof: We need to show that ww > 4 implies 2p, > gy. Recall that 


1 l 
Po=— and q= — . 
[a | 


406 CHAOS, CANTOR SETS, AND HYPERBOLICITY FOR THE LOGISTIC MAPS [May 


Since w > 4, we have 0<1—(4/p) <1, so y1—- (4/u) > 1- G/p). After 
multiplying both sides by 1/2, we have 


> 


[rR 


1 
iu 


Or 


Now we have all the ingredients we need to prove hyperbolicity. 
Theorem 11. Jf uw > 4, then A, is a hyperbolic set for f,,. 


Proof: Let x € A, Assume that x > 1/2; the case where x < 1/2 follows by the 
symmetry of f, about 1/2. We need to find an integer n (which may depend on x) 
such that |(f7')'(x)| > 1. The hyperbolicity of A,, then follows from Lemma 4. 

If x > p,, we can let n = 1 (why?). If x = q,, then f''(q,) = 0 for n > 2, so 


(F7)' (ai) | =|FiCad) | fC) | | £00) |" = wt Ve? = 4 = VT - (472), 


which is strictly greater than 1 for all n sufficiently large. 

Now concentrate on x between gq, and p,. The Return Lemma ensures that 
there is an n such that f(x) €[ py, p,). Let I, , be the component of I, that 
contains x. There are two cases to consider: Mier I; Cl, Py); OF it is not. 

Suppose I, ; C[q,, p,). Since f? maps J, ; monotonically onto [0, 1] (see [1, 
p. 36], [4, p. 71), or [6, p. 31), we can partition J, , into three subintervals, 
Tn = Len, U Kn,j U Raj where fi(L,, )) = [0, pol, ff (K,, )) = (Dos Py), and 
ie. j= [Pi 1] Since 'L, VEN oC CA and R,, ; CL, ,j © 141 Pi), Lemma 
10 ensures that fr Ln, pI S ‘IL, | and ACR, pl > IR, jl- That is, f? must 
do some stretching near ‘both ends of I, ;. By the Mean Value ‘Theorem applied to 
fz, there is a point yeL, ; and a point zER,,, such that |(f7)'(y)| > 1 and 
f2)'(z)| > 1. Since f(x) € [ po, pi), we have x © closure(K,, s) soy<x<z. 
Since f/' does not have a critical point in [y, z], the Minimum Principle ensures 
that |(f7)'Cx)| > 1. 

Now suppose that J, ; is not a subset of [q,, p,). Once again, partition [,, ; into 
three subintervals, J, ,= L,, UK,; UR,,;, where f/(L,, ) = 10, pol, f "(K, > 

= (po; pi), and fi (R, pe [D,, 1]. As before, x € closure(K,, j) because f(x) E 
[ Pos Py). Since x € (q,, p,), one of L,,; or R,, ; is contained in [q,, p,), but, since 
I,,; 18 not a subset of [g,, p,), the other one “of L,,; or R, nj is not contained in 
[a1 p,). Suppose that L,, ; is contained in [q,, p,), and R,,,; 18 not (the other case 
c[q,,1] and I j Old: Pi) # OD, it must be that p, € J, ; 


is similar). Since J, ; 
As before, |f/(L,, p) > |L,, ;|, 80 the Mean Value Theorem ensures that there is a 
point y ab, such that ig d'Cy)| > 1. And |Cf7)'Cp)| > 1 since p, is a 
hyperbolic repelling fixed point. Then x € [y, p,] and f, does not have a critical 


point in| y, p,], so the Minimum Principle ensures that iC fd | ds a 
Remark. This proof of the hyperbolicity of A, is adapted from the idea of an 


“induced map” or “first return map” for f,; see [2, p. 341], and see [1, pp. 75-78] 
for an application of this idea when p = 4. 


1999] CHAOS, CANTOR SETS, AND HYPERBOLICITY FOR THE LOGISTIC MAPS 407 


Theorem 12. If 4. > 4, then A,, is a Cantor set. 


Proof: The proof is now as easy as the case w > 2 + V5. Just observe that since 
c, © A,, the hyperbolicity of A,, ensures that |(f7)'(c,,)| = Ca". The rest of the 
proof is unchanged. Eel 


REFERENCES 


1. R. Devaney, An introduction to chaotic dynamical systems, 2nd ed., Addison-Wesley, Redwood City, 
CA, 1989. 

2. J. Guckenheimer and P. Holmes, Nonlinear oscillations, dynamical systems, and bifurcations of vector 
fields, Applied Mathematical Sciences, 42, Springer-Verlag, New York-Berlin, 1983. 

3. D. Gulick, Encounters with chaos, McGraw-Hill, New York, 1992. 

4. R. Holmgren, A First Course in Discrete Dynamical Systems, 2nd ed., Springer-Verlag, New York, 
1996. 

5. W. de Melo and S. van Strien, One-dimensional dynamics, Ergebnisse der Mathematik und ihrer 
Grenzgebiete (3), Springer-Verlag, Berlin, 1993. 

6. C. Robinson, Stability, symbolic dynamics, and chaos, Studies in Advanced Mathematics, CRC 
Press, Boca Raton, FL, 1995. 


ROGER KRAFT received a Ph.D. from Northwestern University in 1990 under Clark Robinson. He was 
a visiting assistant professor at the University of Cincinnati, Case Western Reserve University, and 
Pomona College, and a postdoc at the M.S.R.I. He is now an associate professor at Purdue University 
Calumet. His field of research is dynamical systems, with special emphasis on two chaotic little 
attractors named Peter and Ziyad. 

Department of Mathematics, Computer Science, and Statistics, Purdue University Calumet, Hammond, IN 
46323 

roger@calumet.purdue.edu 


From the MONTHLY 100 Years Ago 


The following are some of the advanced courses of Mathematics 
offered for the year 1899-1900 at the University of Chicago: 
Twisted Curves and Surfaces, Associate Professor Maschke; Projec- 
tive Geometry, Professor Moore; Theory of Invariants, Professor 
Bolza; Continuous Groups, Professor Bolza; Theory of Functions of 


a Complex Variable, Professor Moore and Associate Professor 
Maschke; Elliptic Functions, Professor Bolza; Hyperelliptic Func- 
tions, Professor Bolza; Abstract Groups, Associate Professor 
Maschke; Elliptic Modular Functions, Professor Moore; Theory of 
Substitution, Professor Moore; Theory of Numbers, Assistant 
Professor Young, etc., etc. 


MONTHLY 6 (1899) 158 


408 CHAOS, CANTOR SETS, AND HYPERBOLICITY FOR THE LOGISTIC MAPS [May 


An Elementary View of Euler’s 
Summation Formula 


Tom M. Apostol 


1. INTRODUCTION. The integral test for convergence of infinite series compares 
a finite sum L7_, f(k) and an integral /;’ f(x) dx, where f is positive and strictly 
decreasing. The difference between a sum and an integral can be represented 
geometrically, as indicated in Figure 1. In 1736, Euler [3] used a diagram like this 
to obtain the simplest case of what came to be known as Euler’s summation 
formula, a powerful tool for estimating sums by integrals, and also for evaluating 
integrals in terms of sums. Later Euler [4] derived a more general version by an 
analytic method that is very clearly described in [5, pp. 159-161]. Colin Maclaurin 
[9] discovered the formula independently and used it in his Treatise of Fluxions, 
published in 1742, and some authors refer to the result as the Euler-Maclaurin 
summation formula. The general formula (24) is widely used in numerical analysis, 
analytic number theory, and the theory of asymptotic expansions. It contains 
Bernoulli numbers and periodic Bernoulli functions and is ordinarily discussed in 
courses in advanced calculus or real and complex analysis. This note shows how 
the general formula can be discovered by an elementary method, beginning with 
the diagram in Figure 1. This approach also shows how Bernoulli numbers and 
Bernoulli functions arise naturally along the way. The author has used this 
treatment successfully with beginning calculus students acquainted with the inte- 
gral test. 


2. GENERALIZED EULER’S CONSTANT. Throughout this section we assume 
that f is a positive and strictly decreasing function on [1,). We introduce a 
‘sequence {d,} of numbers that represent the sum of the areas of the shaded 
curvilinear pieces above the interval [1, m] in Figure 1. That is, we define 


d, = x f(k) - [ F(x) ae, RDFa: (1) 


Figure 1. All the shaded regions above [1, 1] fit inside a rectangle of area f(1). 


1999] AN ELEMENTARY VIEW OF EULER’S SUMMATION FORMULA 409 


It is clear that d,,, > d, and that all the shaded pieces can be translated to the 
left to occupy a portion of the rectangle of altitude f(1) above the interval [0, 1], as 
shown in Figure 1. Because f is decreasing there is no overlapping of the 
translated shaded pieces. Comparison of areas gives us the inequalities 0 <d, < 
d,., <f(D. Therefore {d,} is increasing and bounded above, so it has a finite limit 
C(f) = lim, .. d(n). We refer to C(f) as the generalized Euler’s constant associ- 
ated with the function f. Geometrically, C(f) represents the sum of the areas of 
all the curvilinear triangular pieces over the interval [1,°). These pieces can be 
translated to fit inside the rectangle of area f(1) shown in Figure 1 (without 
overlapping), so we have the inequalities 0 < C(f) < f(1). Moreover, C(f) — d, 
represents the sum of the areas of the triangular pieces over the interval [n, ©). 
These pieces can be translated to the left to occupy (without overlapping) a 
portion of the rectangle of height f() above the interval [n,n + 1]. Comparing 
areas we find 


0<C(f)-d,<f(n), n=2,3,.... (2) 


From these inequalities we can easily deduce: 


Theorem 1. If f is positive and strictly decreasing on [1, ©) there is a positive constant 
C(f) < fC) and a sequence {E,(n)}, with 0 < E,(n) < f(n), such that 


Lf(k) = fH) de + CN) + En), eae (3) 


Note. Eq. (3) tells us that the difference between the sum and the integral is equal 
to a constant (depending on f) plus a positive quantity E,() smaller than the last 
term in the sum. Hence, if f() tends to 0 as n > ~, then E,(n) also tends to 0. 


Proof: If we define E,(n) = f(n) + d,, — C(f), then (3) follows from the definition 
(1), and the inequality 0 < E,(n) < f(1) follows from (2). fm 


If f(n) — 0 as n > &, then (3) implies 
C(f) = lim | Y fk) = fe) ix) (4) 
hoe \ k= 


Example. When f(x) = 1/x, C(f) is the classical Euler’s constant, often denoted 
by C (or by y), and (4) states that C = lim, ,.27%_,0./k) — logan). It is not 
known (to date) whether Euler’s constant is rational or irrational. Its numerical 
value, correct to 20 decimals, is C = 0.57721566490153286060. In this case, Theo- 
rem 1 says that 


n 
3 
k=1 


1 
=logn+C+E(n), where 0 < E(n) <—. 
n 


> 


3. VARIOUS FORMS OF EULER’S SUMMATION FORMULA. In this section we 
no longer assume that f is positive or decreasing. At the outset we require only 
that the integral /;’ f(x) dx exists for each integer n > 2. The key insight is to 
notice that the difference d,, in (1) can be written as 


n—-l 


d,= Y 1(k), (5) 


410 AN ELEMENTARY VIEW OF EULER’S SUMMATION FORMULA [May 


where 


I(k) = f°" F(k) ~ f(a) ae. (6) 


When f is positive and decreasing, as in Figure 2, [(k) is the area of the shaded 
curvilinear triangular piece over the interval [k, k + 1]. However, (5) and (6) are 
meaningful for any integrable f. 


k k+1 


Figure 2. Geometric interpretation of the integral 
I(k) as the area of the shaded region. 


The integrand in (6) has the form udu, where u = f(k) — f(x) and v=x + ¢, 
where c is any constant. If we choose c = —(k + 1) and integrate by parts 
(assuming that f has a continuous derivative), the integrated part vanishes and the 
integral I(k) reduces to 


I(k) = f(x — k= f(x) ae 


In this integral the dummy symbol x varies from k to k + 1, so the quantity k in 
the integrand can be replaced by [x], the greatest integer <x. Make this 
replacement and substitute in (6) to find 3 


d, TK = Ef (x — [x] — 1 f'(x) de 


I 


fe LDF) de ~ ffx) ae 
= [~~ LDF) ae +10) -F(n). 


Now use the definition of d, in (1) and rearrange terms to obtain: 


Theorem 2. (First-derivative form of Euler’s summation formula). For any function 
f with a pono derivative on the interval [1, n] we have 


PO =f f(x) de + f(x LeDs'() de + F(). (7) 


The last two terms on the right represent the error made when the sum on the 
left is approximated by the integral //' f(x) dx. The formula is useful because f 
need not be positive or decreasing. In fact, f can be increasing or oscillating. 
Variants of this formula will be obtained as we attempt to deduce more precise 
information about the error. 

The factor x — [x] is a nonnegative function with period 1. If f’ has a fixed sign 
(as it has when f is monotonic), the integral term in the error has the same sign as 
f'. To decrease the error it is preferable to multiply f’(x) by a factor that changes 


1999] AN ELEMENTARY VIEW OF EULER’S SUMMATION FORMULA 411 


sign so that some cancellation takes place in the integration. To introduce sign 
changes, we translate the function x — [x] down by “ and consider the new 
function x — [x] — > whose graph is shown in Figure 3. The integral term in the 


x —[x]- > 


Figure 3. The periodic function x — [x] — > changes sign. 


error can now be written as 


[iG —[x])f'(x) & = "ls —[x]- ake dx + sfr®) dx. 


The last term is equal to ${f(m) — f(1)}. Using this in (7) we obtain the following 
variant of the first-derivative form of Euler’s summation formula: 


Lk) = fra) art f'[e- be 5) ae + 5) +70). 


Further variations will be obtained by repeated integration by parts in the second 
integral on the right of (8). 

The factor x — [x] — + has the value — + when x is an integer. We modify this 
factor slightly to make it Avanich at the integers, a property that is desirable when 
we integrate by parts. To do this we introduce P,(x), the first Bernoulli function: 


1 
eS 5 if x # integer 


P(x) = (9) 


0 if x = integer. 


The error integral does not change if the factor x — [x] — 5 is replaced by P,(x) 
because the two factors differ only at the integers. Therefore (8) can be written as 


x f(k) = a f(x) dx + r P(x) f’(x) de + =(f(n) +f(1)}. (10) 
Note Ps contrast between (10) and (3), which explicitly displays the generalized 


Euler’s constant C(f). To make (10) resemble (3) more closely, we assume that the 
improper integral /{7P,(x)f'(x) dx converges. Then we can write 


[Px f(x) de = [°P(x)f'(x) de — ['P(x) f'(2) ae 
and (10) takes the form 


Ef) = f(s) ds + CCA) + Bln), (11) 


412 AN ELEMENTARY VIEW OF EULER’S SUMMATION FORMULA [May 


where 
1 00 
C(f) = sf) + f Pf) & (12) 
and 
1 00 
E,(n) = 5f(n) — J P(x) f(a) &. 


Eq. (11) has exactly the same form as (3), but (11) is more general because f is not 
required to be positive or monotonic. The only restrictions on f are so of 
f’ and convergence of the improper integral 


ia P,(x) f'(x) de. (13) 
The improper integral in (13) converges if, and only if, 
lim f P,(x)f"(x) de = 0. (14) 


A sufficient condition for convergence is that /?|f’(x)| dx converges, or equiva- 
lently, that 


im f If'(2)l de = 0. (15) 


To see this, note that the Bernoulli function P,(x) is bounded; in fact, Figure 3 
shows that |P,(x)| < 4 for all x, so (14) follows from (15). 


Example. When f(x) = 1/x we have f'(x) = —1/x? and 


[i @lar= [aa = 


n 
Therefore (15) is satisfied and (12) expresses Euler’s constant as an integral: 


1  Pi(x) 
aa —s de. 


4, FURTHER ANALYSIS OF THE ERROR TERM. Alternate forms of both the 
error term and the formula for the generalized Euler’s constant can be obtained by 
repeated integration by parts. First we introduce a new function P,(x) whose 
derivative is 2P,(x) at all noninteger values of x. The factor 2 is used so that P,(x) 
is the second Bernoulli periodic function that appears in Euler’s summation 
formula. Therefore we require that 


P,(x) = 2f P(t) dt +e, (16) 
0 


where c is a constant to be specified later. The function P, is quadratic on the 
interval [0,1]. In fact, P(x) =x* —x +c if0 <x < 1. Its graph is a parabolic arc 
joining the points (0, c) and (1, c). Outside this interval the graph (shown in Figure 
4) consists of horizontal translations of this parabolic arc because P, has period 1. 
To see this, we use the fact that P, has period 1 and that /jP,(t) dt = 0, which 
implies that {°P,(t) dt = 0 for any interval [a, b] of length 1. Therefore 


P,(x +1) — P(x) = 2f""'P\(t) dt = 0. 


1999] AN ELEMENTARY VIEW OF EULER’S SUMMATION FORMULA 413 


Figure 4. Graph of P,(x) = 2 {¢P,(1) dt +c. 


Because of periodicity, P, has the constant value c = P,(0) at the integers. 
Integration by parts shows that the integral in (10) is 


n 1 1 pn 

J Pie) f'(2) de = SPr(O{F (2) — F'CD))} - 5 J Pal) F(x) ae, 
provided that f” is continuous. Repeated integration by parts leads to the general 
form of Euler’s summation formula, which involves higher order derivatives of f 
and higher order periodic Bernoulli functions that represent polynomials on the 
unit interval [0, 1]. To see exactly how the Bernoulli functions evolve in the process 
we follow the method of the foregoing section and integrate the periodic function 
3P,(t) from 0 to x to obtain another periodic function P(x) whose derivative is 
3P,(x). To guarantee that the integrated function P,(x) is periodic with period 1 
we need /jP,(t) dt = 0. This property governs the choice of the constant c in (16). 
The integral of the quadratic polynomial x* — x + c from 0 to 1 is equal to c — 4, 
so we choose c = < and take 


x 1 
P,(x) = 2) P(t) dt + =. 
Euler’s summation formula can now be restated as follows: 


Theorem 3. (Second-derivative form of Euler’s summation formula). For any func- 
tion f with a continuous second derivative on the interval |1, n| we have 


x n 1 pn 
Xe flk) = fF) de 5 f PoP () 


1 1 
+ SPO F(a) —FO)} + Stl) + FO}. (17) 
Moreover, if the improper integral | Pf" ()| dx converges then we also have 
Y fk) = fi f(x) de + Cf) + E(n), 
k=1 
where 


1 1 1 ,« 
CCP) = sf) — 5 POOL — Ff Pal)f" (2) &, (18) 


and 


1 1 1,0 : 
E,(n) = sf(n) + 5PAO)F(m) + > f Pax) f" (x) ae (19) 


414 AN ELEMENTARY VIEW OF EULER’S SUMMATION FORMULA [Ma 
y 


1 2 3 4 5 


Figure 5. Graph of the periodic Bernoulli function P;(x) = 3/¢P,(t) dt. 


To improve the error estimate we integrate P,(t) from 0 to x and define the 
Bernoulli function P(x) = 3/{P,(t) dt so that P3(x) = 3P,(x). There is no need 
to add a constant in this case because, on the unit interval [0,1], P,(x) =x° — 3x? 
+ 5x, and [jP,(t) dt = 0. The function P; has period 1 because P, has period 1 
and {jP,(t) dt = 0. The graph of P, is a bounded piecewise cubic curve, shown in 
Figure 5. Note that P,(x) vanishes at the integers. Integration by parts over [1, 7] 


gives us 


n 1 n 
J Pala) f" (2) de = — 3 J Pax) f(a) ae, 


provided f® is continuous. This equation, together with Theorem 3, gives a 
third-derivative form of Euler’s summation formula in which the second integral on 
the right of (17) is replaced by + /"P3(x) f(x) dx. The corresponding changes in 
(18) and (19) are replacement of the integrals by = /7P3(x)f®(x) dx and 
= [°P,(x)f (x) dx, respectively. 


5. BERNOULLI NUMBERS AND THE GENERAL FORM OF EULER’S SUM- 
MATION FORMULA. The strategy for obtaining a general version of Euler’s 
summation formula is now evident. Starting with the Bernoulli periodic function 
P,(x) in (9) we introduce, in succession, periodic functions P,(x), P;(x),..., with 
period 1, and a sequence of constants B, such that | 


P(x) =kf P(t) dt +B, for k > 2, (20) 
0 
where each B, is chosen so that 


[P.(1) dt = 0. (21) 


Periodicity implies that P,(0) = P,(1), and (21) shows that each of these values is 
B,. As already noted, on the closed interval [0,1] each function P,(x) is a 
polynomial of degree k when k = 2 or 3. [The case k = 1 is special; P,(x) is a 
linear polynomial x — = only on the open interval (0, 1) and is discontinuous at the 
endpoints.] It is clear (and easily proved by induction) that on the closed interval 
[0, 1] the function defined by (20) is a polynomial of degree k if k > 2. We denote 
this polynomial by B,(x), the usual notation for Bernoulli polynomials. The first few 
are : 


3 1 " F 1 r a ae 
— —_- — = — + — = —_- — + — ; 
(x) =x 5° a(x) =x" -x 5° 4(x) =x 5% 5% 


1 1 
B,(x) =x* -— 2x? +x? - aa B(x) =x° - xt + =x? — =x, 


3 
B(x) =x° - ae + 5% — FT al aera 


1999] AN ELEMENTARY VIEW OF EULER’S SUMMATION FORMULA 415 


The Bernoulli periodic functions are periodic extensions of these polynomials given 
by P(x) = B(x —[x]). The constants B, = P,(0) = P,() are called Bernoulli 
numbers. The first few are 


B,=- 


1 
B.=0,. Boj ==y. B= 0, Bee es, 
a ee AD 3 30 
Next we show that our definitions of Bernoulli numbers and polynomials are 
consistent with the usual definitions, provided we take B,(x) = 1 and B, = 1. Our 
definition in (20) shows that the successive derivatives of these polynomials are 


By) = KB, (x), BEC) = h(E = 1) Byaa(2)s---5 BOC) = rtf} B,C), 


. 
and hence 
k 


r 


B?(0) = r( }B,.(0) - (Ka... (22) 


On the other hand, the Taylor expansion of any polynomial B,(x) of degree k is 
given by B,(x) = X*_, B{&(0)x" /r!, so (22) implies 


k 
B(x) = bo ieee (23) 
r=0 
Taking x = 1 in (23) and noting that B,(1) = P,() = B, for k = 2, we find that 
(23) becomes 
“ (k 
B=). Bea for k > 2. 
r=0 

This is the usual recursion formula for defining Bernoulli numbers (starting with 
B, = 1), and (23) is one of the standard ways of defining Bernoulli polynomials in 
terms of Bernoulli numbers. Consequently, the numbers and polynomials that 
appear in our treatment are the usual Bernoulli numbers and Bernoulli polynomi- 
als that appear in the literature; see [1, p. 265], [2, p. 251], or [5, pp. 160-163]. 

It is well known that the Bernoulli numbers B, with odd index k > 3 are zero, 
so only Bernoulli numbers with even index appear in the general form of Euler’s 
summation formula. It is also known [8, p. 533] that on the interval [0,1] the 
Bernoulli polynomials satisfy the following inequalities for k > 1: 


|By,(x)| < 1B,/ and |By,,,(x%)| < (2k + 1)/By, |. 


The method we have outlined leads to the following odd-order derivative 
version of Euler’s summation formula. A proof is easily given by induction on the 
order 2m + 1. 


Theorem 4. (General form of Euler’s summation formula). For any function f with 
a continuous derivative of order 2m + 1 on the interval [1,n] we have 


e n 1 n 
LAC) = fF) be + ay I Pane FO") 
B,, 
, (2r)! 


= 


+ 


1 
(FO (n) — f-P(1)} + S (FC) + FC}. (24) 


416 AN ELEMENTARY VIEW OF EULER’S SUMMATION FORMULA [May 


Moreover, if the improper integral [°|f@"*” (x)| dx converges then we also have 


LY sk) = f'Ml2) de + C(f) + B/C), (25) 
where 
1 1 
Cf) = 5AM) — E earn 
1 oo 
+ ae ayy ff Paes ef" (x) a (26) 
and 
E(n) = S/n) + LPO 
— Gm aytd, ames) F2" 4? (x) ae (27) 


Example. When f(x) = 1/x we have f@"*)(x) = —(2m + I! /x?"*?, and (26) 
gives the following expression for the classical Euler’s constant: 


1 B, B, By», © Pym 41(X) 
CSS Se eee —_—— dx. 28 
p) 2 A am J x 2mt2 ( ) 
The corresponding error term (27) becomes 
1 B, B, Bon © Pym 41(X) 
E SS eer aa SE 29 
(7) on In? 4n4 Imn2™ i x2 t2 ( ) 


One is tempted to let m — © in (28) and obtain an infinite series for Euler’s 
constant. However, the integral in (28) does not tend to 0 as m — ~ and, in fact, it 
can be shown that the infinite series ©B,,/(2k) diverges rapidly [see 6, p. 529], so 
(28) is not very useful for calculating C. Nevertheless, as we show in the next 
section, (25) and (27) can be used to calculate C very accurately. 


6. CALCULATION OF EULER’S CONSTANT. We use Euler’s summation for- 
mula to calculate the first 7 digits in Euler’s constant. Take f(x) = 1/x in (25) and 
rewrite it as 


H 1 
C= 2 hem logn — E,(n), (30) 


where a) is given by (29). tas m = 3 in (29) we find 
1 B, B, Bs © P,(x) 

Pin) SS eS 

ee 2n 2n? 4n*  6n° ~ x 
1 1 1 

2n 12n 120n* =. 252n® n 

Using the inequality |P,(x)| < 7|B,| = <, we get 

0 Pa( x 

2) al 


n x® 


1999] AN ELEMENTARY VIEW OF EULER’S SUMMATION FORMULA 417 


and (30) can be written in the form 


ae | 1 1 1 1 

Cn Ly eho, te on ms «OD 
where 0 < |E(n)| < 1/42n’. Using a hand calculator that displays 12 digits we 
find Dif, k~' = 2.92896825381 and log 10 = 2.30258509299. If n = 10 the sum of 
the error term E(n) plus the term with 252n° in the denominator in (31) is too 
small to influence the seventh digit. Neglecting these terms and retaining 8 digits 
in the calculation we find 


1 1 1 
C + 2,92896825 — 2,30258509 — 5— + 7 — 


= 0.62638316 — 0.05000000 + 0.00083333 — 0.00000083 
= 0.57721566 


This calculation, using m= 3 and n = 10 in (29) and (30), which guarantees 
7 decimal places, actually gives the first 8 correct digits of C. Knuth [7] used (29) 
and (30) with m = 250 and n = 10,000 to calculate the value of C to 1,271 
decimal places. 

This note outlines only one application of Euler’s summation formula. Others 
can be found in Knopp’s treatise [6]. One of them uses the increasing function 
f(x) = log x to derive Stirling’s asymptotic formula for the logarithm of n!. Euler’s 
summation formula and its relation to Bernoulli numbers and polynomials pro- 
vides a treasure trove of interesting enrichment material suitable for elementary 
calculus courses. | 


REFERENCES 


1. Tom M. Apostol, Introduction to Analytic Number Theory. Springer-Verlag, New York, 1976. 

2. Tom M. Apostol, Mathematical Analysis, Second edition. Addison-Wesley, Reading, Mass., 1974. 

3. L. Euler, Methodus universalis serierum convergentium summas quam proxime inveniendi, Commen- 
tarii academie scientiarum Petropolitanae, Vol. 8 (1736), pp. 3-9; Opera Omnia, Vol. XIV, 
pp. 101-107. 

4. L. Euler, Methodus universalis series summandi ulterius promota, Commentarii academie scientiarum 
Petropolitanae, Vol. 8 (1736), pp. 147-158; Opera Omnia, Vol. XIV, pp. 124-137. 

5. E. Hairer and G. Wanner, Analysis by Its History. Springer-Verlag, New York, 1996. 

6. K. Knopp, Theory and Application of Infinite Series, R. C. Young, translator. Hafner, New York, 
1951. 

7. D.E. Knuth, Euler’s constant to 1271 places, Math. of Computation, v. 16 (1962), pp. 275-281. 

8. D. H. Lehmer, On the maxima and minima of Bernoulli polynomials, Amer. Math. Monthly, v. 47 
(1940), pp. 533-538. 

9. Colin Maclaurin, A Treatise of Fluxions. Edinburgh, 1742. 


TOM M. APOSTOL received his Ph.D. in 1948 with a thesis in analytic number theory written under 
the direction of D. H. Lehmer at UC Berkeley. He joined the Caltech faculty in 1950 and became 
professor emeritus in 1992. His list of publications contains 58 research papers and several books, 
including his pathbreaking Calculus in two volumes, first published in 1961, Mathematical Analysis 
(1957), and Introduction to Analytic Number Theory (1976), all of which are still in print. He is director 
of Project MATHEMATICS!, a prize winning series of videos and other educational activities he 
initiated twelve years ago. His 50-year career in mathematics is described in an engaging article by 
Don Albers, An Interview with Tom Apostol, published in the September 1997 issue of The College 
Mathematics Journal. 

Project MATHEMATICS!, 1-70 Caltech, Pasadena, CA 91125 

apostol@caltech.edu 


418 AN ELEMENTARY VIEW OF EULER’S SUMMATION FORMULA [May 


Marriage, Magic, and Solitaire 


David B. Leep and Gerry Myerson 


1. SOLITAIRE. Here’s a solitaire game you can always win. 

Deal out a deck of cards, face up, into a 4 X 13 array. The object of the game is 
to select 13 cards, one from each column, in such a way as to get one card of each 
denomination. 

It turns out that it is always possible to make such a selection. The proof is a 
simple application of Hall’s Marriage Theorem, as we show in Example 1 in the 
next section. In Sections 3 and 4, we identify winning the solitaire game with 
decomposing a semi-magic square into a linear combination, with positive integer 
coefficients, of permutation matrices. The remainder of the paper discusses the 
number of permutation matrices needed to express a given semi-magic square. 


2. MARRIAGE. Suppose there are sets A,, A>,...,A,, and you wish to know 
whether there exist distinct objects x,,x,,...,%,, Such that x, is in A,, x, is in 
A,,..., and x, is in A,—we’ll call this a transversal. If any A; is empty, then it’s 
clear that x, does not exist; a simple necessary condition for the existence of a 
transversal is that #A, > 1 for all j—we write #S for the cardinality of the set S. 

If among the sets A,,..., A,, there are two whose union has only one element, 
then there can be no transversal. More generally, a necessary condition for the 
existence of a transversal is that #U,;-, A; = #J for every index set J C {1,..., n}. 

Hall’s Marriage Theorem states that this simple necessary condition is also 
sufficient: 


Theorem 1. There exist distinct x,,...,x, such that x; © A, for all j if and only if 
#U;<,;A4;2 #J for allJ c{1,..., n}. 


Many proofs are known, and the reader with access to combinatorics and/or 
graph theory textbooks will have little difficulty finding one, so we do not present 
one here. The compilation [2] contains Hall’s original proof, and the spiffy proof of 
Halmos and Vaughan. The interpretation wherein the “objects” are men and 4, is 
the set of suitable marriage partners for the jth woman is the origin of the name, 
‘Marriage Theorem.” 

The application to the solitaire game is as follows. 


Example 1. Let the objects be the 13 denominations, and let A; be the set of all 
denominations of cards in the jth column. For example, if column 7 has an ace, a 
deuce, and two jacks, then A, = {ace, deuce, jack}. Any collection of k columns, 
1<k < 13, contains 4k cards, hence contains cards of at least k different 
denominations (since there are only 4 cards of each denomination). But this is 
precisely the condition for Hall’s Theorem to apply, and it tells us we can choose a 


different denomination from each column. 


1999] MARRIAGE, MAGIC, AND SOLITAIRE 419 


3. MAGIC. That could be the end of the discussion, but instead we approach the 
problem from a different point of view, in order to introduce the topic we really 
want to talk about: semi-magic squares. Much of what we have to say applies, 
mutatis mutandis, to doubly-stochastic matrices, so there should be something here 
to appeal to a variety of mathematical tastes. 

Having dealt out the cards, construct a 13 x 13 matrix A, as follows. Each 
column in A corresponds to a column of cards, and each row to a denomination. 
The value of a;, (the usual notation for the entry in row i, column j of A, although 
we also write A(i, j)) is then taken to be the number of cards of denomination i in 
column j. In Example 1, we would have a,...7 = 1, Gjax.7 = 2, and a = 0. 

The matrix so constructed enjoys the following properties; 

1. its entries are non-negative integers, 

2. the entries in each row add up to 4 (because there are exactly 4 cards of each 
denomination), and 

3. the entries in each column also add up to 4 (because there are exactly 4 cards 
in each column of cards). 

Thus, the matrix is a semi-magic square; a Square array of non-negative integers 
having constant line-sums. “Line-sums” means both row and column sums. The 
common value of the line-sums is called the magic constant of the semi-magic 
square, and is denoted by m. In a magic square, the entries along each diagonal 
also add up to m, but we do not invoke this condition in the sequel. 

Hall’s Theorem has the following consequence: 


queen, 7 


Theorem 2. A non-zero semi-magic square has a transversal all of whose elements are 
non-zero. 


In this context, “transversal” means a set of entries meeting each line exactly 
once (that is, one entry from each column, each from a different row). For, let the 
columns correspond to sets, and the rows to objects, and let a;; non-zero mean 
that object 7 is in set j. In any k columns, the non-zero entries add up to km. 
Restricting our attention to those & columns, if fewer than k rows meet those 
columns in non-zero entries, then at least one row meets those columns in entries 
that add up to more than m; but this is impossible, since the entries in each row 
add up to exactly m. Thus, Hall’s Theorem applies, and there is a choice of a 
different object from each set; a non-zero entry from each column, each from a 
different row. 

In the 13 X 13 semi-magic square constructed in Example 1 from an array of 
cards, a transversal corresponds to a selection of one card from each column, each 
of a different denomination. Thus we have a second way to use Hall’s Theorem to 
prove that we can always win this game of solitaire. 


4. PERMUTATIONS. Perhaps the simplest non-zero semi-magic squares are those 
with all line-sums 1, the permutation matrices. A permutation matrix is a matrix of 
zeros and ones, the ones forming a transversal. The name arises from the 
association of each such matrix A to a permutation o via a;;, = 1 if and only if 
a (i) =j. This association is a group isomorphism from the multiplicative group of 
n Xn permutation matrices to the group of all permutations of {1,..., n}. 

We can reformulate Theorem 2: if A is a non-zero semi-magic square, then 
there is a permutation matrix P such that A — P has non-negative entries. But 
then A — P is itself clearly a semi-magic square, whence, by induction, we deduce 


420 MARRIAGE, MAGIC, AND SOLITAIRE [May 


Theorem 3. Every semi-magic square can be expressed as a sum of permutation 
matrices. 


Theorems 2 and 3 are due to Konig [3]. As an illustration of Theorem 3, we 
note that 


8 1 6 1 0 O 0 0 1 0 O 1 
3 5 7/=71/0 0 17 4+4/0 1 OF} +2]}/1 O 0 
4 9 2 0 1 O 1 O O 0 1 QO 

1 O O 0 1 O 

+10 1 O;+{1 0 O 

0 0 1 0 O 1 


A. doubly-stochastic matrix is a matrix with non-negative real entries and all 
line-sums equal to one. Dividing any non-zero semi-magic square by its magic 
constant yields a doubly-stochastic matrix. Birkhoff [1] proved that every doubly- 
stochastic matrix 1s a convex combination of permutation matrices; see also 
[6, Theorem 5.4 of Chapter 5]. 

An expression of a semi-magic square as a sum of permutation matrices is, in 
general, not unique. We may ask for an expression that uses as few distinct 
permutation matrices as possible. The rest of this paper is an attempt to come to 
grips with this and related questions. 


5. THE BASIS. The concepts of permutation matrix and semi-magic square gener- 
alize readily to square matrices with entries from any ring R with unit. Let the unit 
element of R be 1. Then a permutation matrix over R is, as before, a matrix of 
zeros and ones, the ones forming a transversal. A constant line-sum matrix over R 
is a Square array of elements of R having all line-sums equal. We reserve the term 
‘“semi-magic square” for a constant line-sum matrix over the integers with non- 
negative entries. Any linear combination of permutation matrices with coefficients 
in R is a constant line-sum matrix over R. We have seen that any semi-magic 
Square with non-negative integer entries is an integer-linear combination of per- 
mutation matrices, and we now show that this, too, generalizes to constant 
line-sum matrices over R. The case n = 1 is trivial, and a 2 X 2 constant line-sum 


matrix must look like 
a b\) {1 O O 1 
t ) aac ue 7) 


Thus, we may assume n > 3. Let &, be the set of all permutation matrices 
corresponding to those transpositions and 3-cycles that move 1, together with the 
identity matrix. That is, @, contains the permutations of the form (1j), 2 <j <n, 
and those of the form (1jk),2 <j <n,2<k <n, j #k, and the identity. Then 
, 1s a linearly independent set, over any ring whatsoever. For if Lp -<ga,P, = 9, 
then a, ;,, must be zero, since P,,;,) is the only matrix in &, with a non-zero entry 
in row j, column &. And if all the a,,,) are zero, then a,,) must be zero, since 
each P,, has a one in the first row that none of the others has. Finally, a, must 
be zero. 

Now let A be any » Xn constant line-sum matrix over R. Let B=A — 
es eG jx Pa je), taking the sum over all j and & distinct from each other and one; 
then b, = 0 for all these j and k. Let C=B— Li_,b,,Pq,. Then C has all 


j=l 
line-sums zero, since its first row is entirely zeros. Each column of C, other than 


1999] MARRIAGE, MAGIC, AND SOLITAIRE 421 


the first, has n — 1 zeros, hence, n zeros; then, looking across the rows, we see 
that all the entries in the first column must be zero as well. Thus, A = La), Pa jx) 
+ Ub, ,Pqyj) expresses A as a linear combination of permutation matrices (with 
coefficients in the ring generated by the entries of A). 

Summing up, we have proved: 


Theorem 4. For any ring R with unit, the set of all R-linear combinations of elements 
of &,, is the set of alln X n constant line-sum matrices with entries in R. 


A closer look at the proof leads to our next result. 


Theorem 5. Each n X n permutation matrix can be written as a +1-combination of 
at most 2n — 1 elements of &, (meaning, a linear combination in which each 
coefficient is 1 or —1). 


Proof: Let A in the proof of Theorem 4 be a permutation matrix. For each Jj, 
2<jJ] <n, there is at most one k, k #1, k #J, such that a, = 1. Thus, at most 
n — 1 of the coefficients a,,;,) are one (the rest being zero), and no two of these 
have the same value of j. So, the first row of B = A — Ldq jy Po jy) takes all n of 
its entries from {—1, 0, 1}, and bq jy 18 in {—1, 0, 1} for all j. ia 


That Theorems 4 and 5 proclaim a special property of &, can be seen from the 
following equation, valid for any n > 4: 

22 = (12) + (23) + (34) + (41) — (1234) — (4321), (1) 
where we have adopted the notational convenience of replacing a permutation 
matrix with the permutation it represents. It is easy to check that the six matrices 
on the right are linearly independent over any ring R that does not have a 
non-zero element x satisfying x +x =0; but the identity matrix cannot be 
expressed as a + 1-combination of any linearly independent set that includes these 
six matrices, and it cannot be written as an R-linear combination at all, if R has no 
element x satisfying x + x = 1 (for example, if R is the integers). 

This example suggests a question, for which we do not know the answer: 
given n, for which integers m does there exist an n X n semi-magic square A, 
a linearly independent set of permutation matrices {P,,..., P.}, and non-negative 
integers c,,...,c, with gcd(c,,...,c,) = 1, such that mA = ujc;P.? Equation (1) 
shows that we may take m = 2 for every n > 4; indeed, from 


(m — 2)I = (12) + (23) + +++ +(m—1m) + (mi) 
—(12...m) —(mm—1...1) 


it is easy to verify that for any n we can take any m not exceeding n — 2. 


6. HOW MANY? (BIG FIELDS). It is easy to see that the set of all n Xn 
constant line-sum matrices over a field F forms a vector space over F. What is the 
dimension of this space? 


Theorem 6. The dimension of the vector space of alln X n constant line-sum matrices 
over a field F is n? — 2n + 2. 


This can be seen in several different ways. 
1) Assign arbitrary values to a;, 1 <i<n-—1,1<j <n” -—1, and also to a,,, 
making (n — 1)* + 1 arbitrary choices in all. There is a unique choice of each a 


in? 


422 MARRIAGE, MAGIC, AND SOLITAIRE [May 


2<i<n—1, and each a,,,1 <j <n — 1, that makes the corresponding row or 
column sum equal to the sum of the entries in the first row, and then a unique 
choice of a,,, to complete the constant line-sum matrix. 

2) To be a constant line-sum matrix is to satisfy 2n — 1 equations of the form, 
“the entries in row 1 add up to the same number as the entries in a different line.” 
There is one dependence relation among these equations, since the sum of all the 
row sums equals the sum of all the column sums, so the vector space has 
codimension 2n — 2 in the vector space of all n X n matrices, which means that 
the dimension is n? — (2n — 2). 

3) The basis &, has (n — 1)(n — 2) elements of the form (1jk), n — 1 of the 
form (1j), and the identity, making n? — 2n + 2 in all. | 

It follows from Theorems 4 and 6 that, over a field, any m X n constant line-sum 
matrix can be expressed as a linear combination of n* — 2n + 2 or fewer permuta- 
tion matrices. It also follows that, over an infinite field (or, indeed, a sufficiently 
large finite field), there exist constant line-sum matrices that cannot be expressed 
as a linear combination of fewer than n? — 2n + 2 permutation matrices. This is 
based on the observation that no vector space over an infinite field is the union of 
finitely many proper subspaces, which is a corollary to a technical lemma that we 
have relegated to the appendix. 


7. HOW MANY? (NON-NEGATIVE INTEGERS) (THEORY). Life is somewhat 
different over a (small) finite field, but we postpone discussion of that situation 
until we have considered the integers. Results about linear combinations with 
positive integer coefficients do not follow trivially from results about fields, but 
they do follow: 


Theorem 7. Each n X n semi-magic square can be expressed as a linear combination, 
with positive integer coefficients, of n> — 2n + 2 or fewer permutation matrices. 


Proof: We follow the argument by which Marcus and Ree [5] proved that every 
doubly-stochastic matrix is a convex combination of n* — 2n + 2 or fewer permu- 
tation matrices. Let A be a non-zero n X n semi-magic square (if A = 0, there is 
nothing to prove). By Theorem 2 we know there is a permutation matrix P, such 
that A — P, has non-negative integer entries. Choose m, as large as possible, 
subject to A, = A — m,P, having non-negative entries. Note that P, has a one in 
some spot where A, has a zero and that the magic constant of A, is strictly less 
than that of A. Now apply the same procedure to A,, and iterate to termination. 
Termination must occur, since the magic constants form a strictly decreasing 
sequence of non-negative integers. When the procedure terminates, we have 
A=m,P,+-::+m,P. for some r. But the matrices P,,..., P,. are linearly inde- 
pendent (over, say, the rationals), since each has a one in a spot where its 
successors all have zero. So, r is no greater than the dimension of the space 
spanned by all the n X n permutation matrices, and we know from Section 6 that 
this dimension is n? — 2n + 2. = 


We would like to know whether there is an “integer proof” of Theorem 7, that 
is, a proof that does not rely on embedding the integers into a field and using 
dimension, a vector space concept. 


Theorem 8. For every n there exist n X n semi-magic squares that cannot be expressed 
as a linear combination, with non-negative integer coefficients, of n> — 2n + 1 permu- 
tation matrices. 


1999] MARRIAGE, MAGIC, AND SOLITAIRE 423 


We give three proofs. 


First proof: Let A be an n X n constant line-sum matrix with non-negative rational 
entries, and assume that A is not a linear combination with rational coefficients of 

*— 2n+1 permutation matrices. Such matrices exist by a corollary to the 
technical lemma in the appendix. Let m be a common multiple of the denomina- 
tors of the entries of A. Then mA is a semi-magic square, and is not expressible as 
a linear combination with rational coefficients (nor, a fortiori, with non-negative 
integer coefficients) of n? — 2n + 1 permutation matrices. For, if there were such 
an expression for mA, then dividing through by m would give an expression for A 
as a rational linear combination of n? — 2n + 1 permutation matrices. 7 


Second proof: We count the number of n Xn semi-magic squares with magic 
constant N, and the number of linear combinations of n* — 2n + 1 permutation 
matrices with positive integer coefficients adding up to N, and we see that, if N is 
large enough, there are too many of the former to be accounted for by the latter. 

Given integers a,, with N(n — 2)/(n — 1)? <a,,<N/(n- 1) forl <i<n-1 
and 1<j<n-—1l, ' there exist non-negative integers Lins 1 <is<n, and a,,, 
1 <j <n, such that A is a semi-magic square with magic constant N. Thus, the 
number of squares with magic constant N is at least c,N“’~”. Here and in the 
following discussion c,,c,,... depend on 7 but not on N, and the exact nature of 
the dependence is irrelevant. | 

To count the number of non-negative integer linear combinations of n* —2n +1 
permutation matrices, with all line-sums equal to N, we note first that there are 


| 2) . Re .|= = c, ways of choosing the permutation matrices. Having chosen them, we 


have only to count the number of expressions L'_;?"*1!a,P, subject to the condi- 
tions La, = N and a, = 0 for all j. But the number of ways to meet the conditions 
is i a ae 7 : which is a polynomial in N of degree (n — 1)* — 1 and is thus 
oo _ 
bounded above by c,;N“~~! for some c;. So, the total number of semi-magic 
Squares ys magic sonctant N representable as non-negative meee linear combina- 
tions of n* — 2n + 1 permutation matrices is at most c, N (7-1-1 where Cy = CC. 
If N is large enough, c,N“-’ >c,N“~»°-1, so there must be semi-magic 
squares that cannot be Bcareted as a non-negative integer linear combination of 


* — 2n + 1 permutation matrices. isl 


We could use this second proof to estimate the value of N needed, but we have 
thrown too much away for the estimate to be any good. Our third proof actually 
constructs the object whose existence is established by the first two proofs. 


Third proof: Let P,,..., P;, d =n* — 2n + 2, be the special basis &, discussed in 
Section 5, ordered in such a way that all the 3-cycles come first, then the 
transpositions, finally, the identity. Let A = L4 j-1¢;f;, where c; is any sequence of 
positive integers growing fast enough to satisfy Cc; > Y4i4U —k)c, for all j (the 
sequence 1, 2,5, 13,34,... of alternate Eibonaéel aamiben will do, barely). We 
claim that A cannot be expressed as a positive integer linear combination of fewer 
than n? — 2n + 2 permutation matrices. 

Recall that each P; has a “special spot” where it has a one and where each P,, 
k > j, has a zero. Given any j, and any matrix B, we write B(j) for the entry of B 
in the special spot of P.. 


424 MARRIAGE, MAGIC, AND SOLITAIRE [May 


Let A = Li_,a,Q; for some positive integers a; and some permutation matrices 
Q;. Since AC) =c, = 1, we must have Q,1) = 1 for some j. Re-ordering, if 
necessary, we may assume Q,(1) = 1. It follows that a, =c,. Let A, =A — es 

Now suppose that for 1<j<k—1 we have Q(j) = 1, L<a, Ee 3 ae 
and A; =A,_, — a,Q, =A — a,Q, — -*: —a,Q). Note that =, SAC) 2 ES Cs Tt 
follows that | 


Deep 2k eyece 2, De see= P= A,_,(k) < Ls 


Since A,_,(k) > 1, we must have Q(k)=1 for some j >k. Re- pean if 
necessary, we may assume Q,(k) = 1. Then La. ic 

, By induction, we see that O(j)=1 for 1<j<n*—2n+2, and r= 
n> —2n +2. im 


8. HOW MANY? (NON-NEGATIVE INTEGERS) (PRACTICE). Let’s look at 
some numerical examples. The third proof of Theorem 8, in the case n = 3, 
produces the semi-magic square 


34. 6 15 
7 47 1 
14 2 39 


with magic constant 55, so this matrix cannot be written as a positive integer linear 
combination of fewer than 5 permutation matrices. But the same is true of the 
semi-magic square 


L -3r «3 
» 2 2 
O° 2a, 


with magic constant 7. For to account for the entry in the upper left corner, either 
the identity or (23) must be involved. By symmetry, it doesn’t matter which, so let’s 
assume J is a summand. Subtracting J leaves a one in the (2,2) position, which 
forces involvement of (13), and a one in the (3,3) position, which forces involve- 
ment of (12). Subtracting these leaves a matrix with two non-zero entries in each 
line, so two more matrices are needed; in total, 5. 

By brute force, one can show that this result is sharp, that is, that every 3 x 3 
semi-magic square with magic constant less than 7 can be written as a positive 
integer linear combination of 4 or fewer permutation matrices. 

Theorems 7 and 8 imply that every 4 x 4 semi-magic square can be written as a 
non-negative integer linear combination of 10 permutation matrices, and that there 
exist 4 X 4 semi-magic squares that cannot be written as a non-negative integer 
linear combinations of 9 permutation matrices. A 4 x 4 semi-magic square that 
requires 10 permutations is 


S oo “Y 4 
11 18 #1 1 
10 3 16 2 

S af 14 


with magic constant 31; we have been unable to find an example with a smaller 
magic constant. The proof that this semi-magic square requires 10 permutations 
reveals a method for producing, for any n, an n Xn semi-magic square that 
cannot be represented by (that is, written as a non-negative linear combination of) 
fewer than n* — 2n + 2 permutation matrices. 


A= 


1999] MARRIAGE, MAGIC, AND SOLITAIRE 425 


Let A = L'_,a,Q, with positive integers a; and permutation matrices Q,. Since 
A(2,3) = 1, there must be some j such that Q,(2,3) = 1. We may assume Q,(2, 3) 
= 1. Then a, = 1. Let A, =A — 4,Q,. 

Now A(2, 4) = 1 and Q,(2,4) = 0 (since Q,(2,3) = 1—this is a refinement in 
the reasoning of the third proof of Theorem 7). So A,(2,4) = 1, and we may 
assume Q.(2,4) = 1 and a, = 1. Let A, = A, — 4,Q,. 

Since 1 < A,(3, 2) < 3, we may assume Q,(3,2) = 1 and 1 <a, <3. 

By similar reasoning we find Q,(3,4) = Q.(4,2) = Q,(4,3) = 1, 1 < a, < 2, 
1 <a; <5,and1<a,<7. Let A, =A — Lf_,a,Q.. 

Now comes the tricky part; showing that A,(1, 1) = 1 (whence Q,(1, 1) = 1 for 
some j > 7). If Q,(1, 1) = 1 then, since Q, is a permutation matrix and Q,(3, 2) = 1 
we must have Q,(2,3) = 1 or Q,(2,4) = 1. Thus, Q,(1, 1) < Q,(2,3) + Q,(2, 4). 
Similarly, Q;(1, 1) < Q.(2,3) + Q.(2,4) and O,(1, 1) < O.(2,4) + O,(3,4). It 
follows that 


> a,Q,(1, 1) < (a3Q3(, 3) + a5Q;(2, 3) + a) 
~ +(a30,(2, 4) + a50,(2,4) + a,0,(2,4) + a5) 
+ (a,Q,(3, 4) + a4) 
< A(2, 3) + AQ, 4) + AG, 4) = 4. 


Since A(1,1) = 5, we have established 4,(1,1) = 1. With the obvious definitions, 
the same sort of reasoning shows that A,(1,2), A,(1,3), and A,(1,4) are all 
positive, so r > 10. 

We can prove that any 4 X 4 square with magic constant 14 or less can be 
written with fewer than 10 permutation matrices, but we have been unable to close 
the gap between 14 and 31, or the much larger gaps in our knowledge for n > 4. 


9. HOW MANY? (SMALL MODULD. Let gq be a positive integer, and let A be 
an nm X n constant line-sum matrix over Z/qZ. We showed in Section 6 that A can 
be expressed as a Z/qZ-linear combination of no more than n? — 2n + 2 permu- 
tation matrices. If q is not too big (relative to n), we can do better. 


Theorem 9. Any n Xn constant line-sum matrix over Z/qZ can be written as a 
Z/qZ-linear combination of no more than (q — 1)n permutation matrices. 


We note that (gq — 1)n is less than n* — 2n + 2, provided g <n-—1. We 
illustrate Theorem 9 with an example before embarking on the proof. Working 
over Z/3Z, consider 


» 2: 9 
—|1 1 1 2 
A=!) 0 0 4} (2) 
122 0 


We can construct a semi-magic square A’ that is congruent to A (modulo 3), with 
magic constant 8 = (g — 1)n: 


A = 


PN NWN 


Z 
1 
3 
2 


Go 


Trivially, A’ can be written as a sum of 8 permutation matrices; this serves to 
express A as a Z/3Z-linear combination of 8 permutation matrices. 


426 MARRIAGE, MAGIC, AND SOLITAIRE [May 


Proof of Theorem 9: Let A be an n X n constant line-sum matrix over Z/qZ. We 
may view the entries of A as integers a;, satisfying 0 < a;, < gq — 1. Working now 
in Z, let the maximal line sum in A be m; note that m < (q — 1)n. We now 
construct a semi-magic square A’, congruent, entrywise, to A (modulo q), with 
magic constant m. Choose any row of A whose entries do not add up to m (if 
there is no such row, A is already semi-magic), and any column of A whose entries 
do not add up to m. Where the chosen row and column intersect, add to the entry 
a large enough multiple of g to bring the larger of the row and column sums up to 
m. This does not change the congruence class of the entry (modulo gq), and 
it decreases by at least one the number of lines with line-sum not equal to m. 
After at most 2m — 2 applications of this procedure we arrive at a semi-magic 
square, A’. 

Now A’ is a semi-magic square with magic constant m, so it can certainly be 
written as a sum of m permutation matrices. As corresponding entries in A’ and 
A are congruent (modulo q), the same m permutation matrices sum to A when 
viewed over Z/qZ. Since m < (q — 1)n, we are done. re 


In the case g = 2, Theorem 9. is best possible, since it is clear that the n Xn 
all-ones matrix requires m permutation matrices. In other cases, we can often do 
better; if g is not a prime, we can always do better. It helps to introduce some 
notation here. Let BCA, q) denote the least r such that A can be written as a 
Z/qZ-linear combination of r permutation matrices, and let B(n, gq) denote the 
maximum value of BCA, q) over all n Xn constant line-sum matrices A. In this 
notation, Theorem 9 says B(n, gq) < (q — In. 


Theorem 10. Let s and t be integers, and let A be ann X n constant line-sum matrix 
over Z/stZ. Then BCA, st) < BCA, s) + BM, ft). 


Proof: Let B(A,s) =k, so A = Lic;P, + sA, for some integers c,,...,c,, some 


permutation matrices P,,...,P,, and some constant line-sum matrix A,. Then 
we see that A, = »4,Q, + tA, for some integers d,,...,d,, some permuta- 
tion matrices Q,,...,Q,, and some constant line-sum matrix A,, with 


I < B(n,t). Then 
k l 
A= )ic)P, + ))sd,Q; (mod st), 


and k +1 < B(A,s) + Bn, t). a 


Corollary 11. Let the factorization of q into powers of distinct primes be q = 
Dit ct: pe’. Then 


p(n, q) = 4) 8(n, = Lal, ANT (3) 


If g is not a prime then (3) is always an improvement over the bound in 
Theorem 9. We can often make a small improvement on the bound (3), even for 
prime q. Rather than state the result in its full (and somewhat tedious) generality, 
we illustrate its application to 4 xX 4 matrices over Z/3Z by establishing that 
B(4, 3) < 7; Theorem 9 allows us to conclude only that B(4,3) < 8. Let A be any 
4 x 4 constant line-sum matrix over Z/3Z that, when viewed as an integer matrix, 
has maximal line-sum 8; for example, the matrix (2). Then 2A has all line-sums 


1999] MARRIAGE, MAGIC, AND SOLITAIRE 427 


congruent to 1 (mod 3), thus, maximal line-sum at most 7 (when viewed as an 
integer matrix). In our example, 


1111 
io 2 Al 
tA=)5 9 0 2 

2 1 1 =~0 


By the procedure of the proof of Theorem 9, 2A can be expressed, over Z/3Z, as 
a sum of 7 permutation matrices. Multiplication by 2 yields an expression for A as 
a Z/3Z-linear combination of 7 permutation matrices, whence B(4, 3) < 7. 

With a bit more work, we can actually prove B(4,3) = 6. For it follows from the 
work of Marcus and Minc [4] that if B is a 4 x 4 semi-magic square with magic 
constant 7, then there is a permutation matrix P such that B — 2P has non-nega- 
tive entries. Since B — 2P is a semi-magic square with magic constant 5, B is a 
positive integer linear combination of 6 or fewer permutation matrices. Thus, the 
number of permutation matrices necessary to represent a 4 X 4 constant line-sum 
matrix over Z/3Z is at most 6, which is best possible: 


1 1 1 =O 
1 1 1 O 
2 2. 2. Q 
2 2 2 0 


cannot be written as a Z/3Z-linear combination of fewer than 6 permutation 
matrices (exercise for the reader). The general question of evaluating B(n, q) 
appears to be very intricate. 


10. APPENDIX. We present a result about vector spaces that is somewhat techni- 
cal, together with two useful corollaries. We would like to thank Bruce Reznick for 
suggestions that improved the exposition in the proof of this lemma. 


Lemma 12. Let V be a vector space over a field F. Let v,,...,v,z and z be in V, and 
let W,,...,W,, be subspaces of V. Assume that no W, contains the subspace V, 
generated by {v,,...,U,}. Let S CF be any set with m + 1 or more elements. Then 
there is a vector v in V that can be written as v = a,v, + +++ +aqgv, + 2 with each a, in 


S, but vis notin W, U+:-UW,. 


Corollary 13. No vector space V over an infinite field F is a finite union of proper 
subspaces. 


Proof: Let W,,...,W,, be proper subspaces of V. Choose v, in V such that v, is 
not in W, for 1 <i <m. Now apply Lemma 12, with S = F and z = 0. 7 


Corollary 14. For every n there is an n X n constant line-sum matrix with non-nega- 
tive rational entries that is not a rational linear combination of n* — 2n + 1 permuta- 
tion matrices. 


Proof: In Lemma 12, let F be the rationals, and let S be the non-negative 
rationals. Take d = n* — 2n + 2, and let v,,...,v, be a linearly independent set 
of permutation matrices. Let V be the span of {v,,..., v,}, which is the space of all 
n Xn constant line-sum matrices with rational entries. Let W,,...,W,, be the 


subspaces generated by sets of n* — 2n + 1 permutation ance one subspace 


428 MARRIAGE, MAGIC, AND SOLITAIRE [May 


for each set of permutation matrices. Lemma 12 ensures that there exist a,,..., a4, 
all non-negative rationals, such that v = a,v, + ++: +a,v; 1s not in any W,. This v 
is a constant line-sum matrix with non-negative rational entries, and is not a 
rational linear combination of n* — 2n + 1 permutation matrices. ia 


Proof of Lemma 12. We may assume that v,,...,v, are linearly independent, for, 
if v,,...,v, are linearly independent, and v.,,,...,v, are dependent on v,,...,0V,, 
we may choose a,,,,...,@, arbitrarily from S, let z' = 4,,,0.,, +++ +agv, + Z, 
and find a vector v that can be written as v = a,v, + ++: +4,v, + 2’. 

For each j, let X¥,;=W, Vo. Then X,,..., X,, are proper subspaces of Vp. 
We may assume that S has exactly m+ 1 elements, and let T = {(X4,4,v, +z: 
a, = S}, so T has cardinality (m + 1)“. We wish to conclude that T is not 
contained in X, U::} UX, 


me 


In fact, we prove that #(X, T) < (m + 1)*~', from which it follows that 


#((X,U...UX,) AT) < Y#(X,NT) <m(m+ 1)" <(m41)° 

j 

#(T). 

For, suppose #(X, 1 T) > (m + 1)47!. Then for each k, 1 < k < d, the pigeon- 
hole principle implies that there exist c,,...,C,_15Cy41,---»Cq mM S such that 
CV, +++ +bu, + +++ +c4v, + z is in X, for two distinct elements 5b of S, say, 
b =b, and b =b,. Then (6b, — b,)v, is in X,, hence v, is in X,. But this is true 
for each k, contradicting the hypothesis that X, is a proper subspace of V,). The 
same argument applies to each X,. az 


REFERENCES 


1. G. Birkhoff, Tres observaciones sobre el algebra lineal, Univ. Nac. Tucumdn Rev. Ser. A 5 (1946) 
147-151. | 

2. Ira Gessel, Gian-Carlo Rota, eds., Classic Papers in Combinatorics, Birkhauser, 1987. 

3. D. K6nig, Uber Graphen und ihre Anwendung auf Determinantentheorie und Mengenlehre, Math. 
Annalen 77 (1915-6) 453-465. 

4. M. Marcus, H. Minc, Some results on doubly-stochastic matrices, Proc. Amer. Math. Soc. 13 (1962) 
571-579. 

5. M. Marcus, R. Ree, Diagonals of doubly stochastic matrices, Quart. J. Math. 10 (1959) 295-302. 

H. J. Ryser, Combinatorial Mathematics, Mathematical Association of America, 1963. 


os 


DAVID LEEP attended MIT and Michigan and had postdoc positions at Chicago and Berkeley. Now at 
the University of Kentucky, his research interests include quadratic forms, number theory, finite fields, 
and occasional dabbling in algebraic geometry. His outside interests include Baroque trumpet music, 
traveling, and day dreaming. 

University of Kentucky, Lexington, KY 40506-0027, USA 

leep@ms.uky.edu 


GERRY MYERSON attended Harvard, Stanford, Cambridge, and Michigan. It was at Michigan that he 
met David Leep. His first publication, joint work with David and fellow Michigan student Brian Conrey, 
was Advanced Problem 6200 in the MONTHLY, March, 1978. It has taken only a bit over 20 years for him 
to team up with David again. He arrived at Macquarie University on the day supernova SN 1987a was 
detected in the Large Magellanic Cloud, but attributes no significance to this coincidence. He enjoys 
folk music, baseball, and writing about himself in the third person. 

Centre for Number Theory Research, E7A, Macquarie University, NSW 2109 Australia 
gerry@mpce.mq.edu.au 


1999] MARRIAGE, MAGIC, AND SOLITAIRE 429 


The Isoperimetric Problem on Surfaces 


Hugh Howards, Michael Hutchings, and Frank Morgan 


1. INTRODUCTION. The isoperimetric problem on a surface is to enclose a given 
area with the shortest possible curve. The classical isoperimetric theorem asserts 
that in the plane the unique solution is a circle. On curved surfaces the isoperimet- 
ric problem is harder and much remains open. Even on the simplest paraboloid the 
“obvious” solution was proved only in 1996 by Benjamini and Cao ((2, Thms. 5, 8]; 
see also [24, Prop. 7], [22, Thm. 3.1], [30, Thm. 1], [29], [26]): 


Theorem 1.1 (Benjamini and Cao). The unique least-perimeter way to enclose given 
area in the paraboloid of revolution 


P={z=x’?+y*}cR (1.1) 


is a horizontal circle {z = c}. 


This article gives our three favorite proofs of the classical isoperimetric theorem 
in the plane and then presents some recent results on other surfaces, including a 
new proof for the paraboloid. Section 2 uses an amazingly simple symmetry 
argument to show that a nice minimizer must be a circle. Unfortunately this 
approach needs to assume that a nice minimizer exists. Section 3 gives a very 
simple, complete proof without assuming a nice minimizer exists, following the 
undergraduate thesis of Howards [15]. Section 4 provides another complete proof, 
a slight twist on a magical proof of Gromov [10]. 

In general surfaces the existence of a nice, one-component perimeter-minimiz- 
ing curve has been astonishingly problematic. Fortunately a relatively easy ap- 
proach is now available from [12], as explained in Section 5. One has to allow the 
curve to bump up against itself. 

Sections 6-8 solve the isoperimetric problem for cylinders, cones, flat tori, and 
Klein bottles. Section 9 treats the paraboloid and certain other surfaces of 
revolution. Section 10 discusses hyperbolic surfaces. 

This work was partly inspired by a more difficult question we heard from 
J.C. C. Nitsche about the soap film between a large wire boundary and a small, 
moveable loop of thread. The thread wants to position itself to minimize the area 
of the soap film outside it. If the thread were constrained to lie in a fixed surface 
bounded by the wire (which unfortunately is not the case), then the thread would 
want to be an isoperimetric curve in that surface. 

Osserman [23] provides a marvelous survey on the isoperimetric inequality. 


2. THE CIRCLE IN THE PLANE, ASSUMING SMOOTH EXISTENCE. We 
assume that there is a compact minimizer C among smooth curves of finitely many 
components and enclosed area 77, and use symmetry to prove it must be a single 
round unit circle; existence is a nontrivial assumption, a fact overlooked by some 
early workers. The proof uses a symmetry argument we heard from Brian White 
and Luen-fai Tam, who thought it originated with Blaschke (see [9, Thm. 3.4], 
[17, Thm. 5.3], and [16, §2]); we have been unable to trace its origin and would be 
grateful to anyone who could help. 


430 THE ISOPERIMETRIC PROBLEM ON SURFACES [May 


Suppose C is not a round circle. Take a horizontal line splitting the enclosed 
area in half. Each half must have the same length, or the shorter half, together 
with its reflection, would be shorter than C. Replacing C by half plus its reflection 
if necessary, we may assume that C is symmetric across the horizontal line. 
Similarly we may assume that C is symmetric across a vertical line. We may 
assume the lines meet at the origin. Now C is symmetric under the composition of 
the two reflections, 1.e., under 180-degree rotation around the origin. Hence every 
line through the origin splits the area in half. C must meet every line through the 
origin orthogonally; otherwise, one half of C, together with its reflection, would 
not be convex, and its convex hull would have less perimeter and more area. It 
follows that C consists of circles about the origin. A single circle is best. We 
conclude that the original C is a round circle. 

This argument can be generalized to prove that a round hypersphere is 
perimeter-minimizing for given volume in R”, in the round sphere S”, and in 
hyperbolic space H”. More generally, it shows that a minimizing cluster of k 
bubbles enclosing k <n prescribed volumes in R” has O(n — k + 1) symmetry, 
assuming known but difficult existence and regularity [16, Thm. 2.6]. It played an 
essential role in the recent proof by Hass, Hutchings, and Schlafly of the equal 
volumes case of the still open Double Bubble Conjecture, which says that the 
familiar standard double soap bubble is the least-area way to enclose and separate 
two given volumes of air ({11], [16], [14], [18], [13)). 


3. THE CIRCLE IN THE PLANE, WITHOUT ASSUMING EXISTENCE. To 
prove that the circle is perimeter-minimizing (but not necessarily uniqueness), by 
approximation it suffices to show that the shortest n-gon enclosing given area 1s 
the regular n-gon. In his undergraduate thesis, Howards [15] gave the following 
geometric proof free of variational calculus, including ideas that we have since 
traced back to Zenodorus about 200 BC, Steiner in 1838 ({27], [28, p. 105 and Fig. 
6]), and Courant and Hilbert [6, p. 166]; see the interesting “A history of the 
classical isoperimetric problem” by Porter [25] and Bonnesen and Fenchel [5, §57]. 

By compactness, there is a shortest n-gon in the 2n-dimensional space of 
vertices. It is convex. Consider two adjacent sides, which determine a triangle, and 
the line L through the common vertex and parallel to the third side of the triangle. 
These two sides must constitute the shortest path to L and back, since all such 
constructions yield triangles of the same area. The first side, together with the 
reflection of the second across L, must form a straight line. Hence the two sides 
have the same length. Therefore the n-gon is equilateral. 

To prove that the equilateral n-gon is regular, we begin with m even. For 
opposite vertices P, Q, the line PQ must have the same area above as below, or a 
reflection of the larger half would enclose more area (or, scaled down, the same 
area with less length). For an intermediate vertex M, the angle PMQ must be 90°, 
or replacing it with a 90° angle and reflecting as in Figure 3.1 would increase the 
area enclosed. Therefore the n-gon is inscribed in a circle and must be regular. 

Finally suppose n is odd. A regular 21-gon comes from putting little triangles 
on the sides of the regular n-gon. If a perimeter-minimizing n-gon, known to be 
equilateral, had more area than a regular n-gon with the same sides, putting those 
little triangles on its sides would yield a non-regular 2n-gon with more area than 
the regular 2n-gon, the final contradiction. 

This completes the proof that the circle is perimeter minimizing. In fact, now 
that we know that a minimizer exists, we can use the above arguments to prove 


1999] THE ISOPERIMETRIC PROBLEM ON SURFACES 431 


Figure 3.1. The angle PMO must be 90°, or replacing it with a 90° angle and reflecting would increase 
the area enclosed. 


uniqueness. Consider any minimizer. It must be convex. As above, a line bisecting 
its perimeter must bisect the area, and any inscribed angle must be 90°. Therefore, 
the minimizer must be a circle. 


4. THE CIRCLE IN THE PLANE, ANOTHER PROOF WITHOUT ASSUMING 
EXISTENCE. Gromov ({10]; see [3, 12.11.4] or [20, 10.5]) gave a proof of the 
isoperimetric theorem in R” by direct comparison. The strategy in R’, for example, 
is to find a vectorfield v on any competing region R of area 7 with smooth 
boundary C and outward unit normal n such that 


div(v) > 2, (4.1) 
ven <l, (4.2) 


with equality everywhere only if R is a disc. If such a v can be found, then the 
isoperimetric inequality follows immediately from Stokes’ theorem: 


length(C) > fv “nh= | div(v) > 2area( R) = 27, 
C R 


with equality only if R is a disc. 
The Gromov proof finds such a v by a very clever construction, but the resulting 
v is not canonical. We now show that there is a canonical such v when n = 2. 
The canonical v is the negative of the gravitational field induced by a substance 
of constant density filling the region R. More precisely, 


1 c= 
u(x) = — : 


—— ay 
7 /yer |x — y| 


By the two-dimensional analog of Gauss’s law, div(v) = 2 in R, so it now suffices 
to prove (4.2). 
Fix a point x in the boundary, and choose polar coordinates (7, 9) around x so 
that n points in the direction 6 = 7. Then 
1 cos 6 


u(x) > n= — 
WT “yeER r 


dy. 


: 
432 THE ISOPERIMETRIC PROBLEM ON SURFACES [May 


Since the area of R is fixed, this integral is maximized if we put the points of R 
where (cos @)/r is largest. The level sets of (cos @)/r are circles tangent to C at x, 
with smaller circles giving larger values of (cos @)/r. So clearly a disc of the given 
area uniquely maximizes the integral, completing the proof. 


5. PROOF OF EXISTENCE OF NICE LEAST-PERIMETER ENCLOSURES. In 
the Euclidean plane and in other special cases where all candidates can be 
convexified, the existence of a (convex) region of least perimeter and prescribed 
area follows from Blaschke’s selection theorem ([4, p. 38], [8, Chapt. 4]). A general 
smooth Riemannian surface S requires a more general argument. There must 
be some hypothesis to prevent the solution from disappearing to infinity as in 
Figure 5.1. Suppose for now that S is a compact surface, perhaps with convex 
boundary. 


Figure 5.1. In the surface of revolution generated by y = 1/x, for any given area, there is a sequence of 
annuli disappearing to infinity with perimeters going to 0. 


For the moment we restrict to images of the unit circle parametrized by 
arclength. Later we consider curves of several components. Then compactness 
properties of Lipschitz functions (Ascoli-Arzela Theorem) immediately produce a 
minimizer. The only problem is that in theory the limit might bump up against 
itself too wildly to permit the standard variational argument that it has constant 
geodesic curvature. The solution may actually bump up against itself, as in the 
cylinder of Figure 5.2. This technical difficulty delayed for 75 years the completion 
of Poincare’s proof that every smooth convex sphere contains a simple closed 
geodesic. In 1982 C. Croke [7] gave a complete proof by minimizing a combination 
of length and energy in a class of piecewise geodesic curves. 


Figure 5.2. Some least perimeter enclosures on the cylinder 
bump up against themselves. 


More recently, Hass and Morgan ({12]; see also [22, Lemma 2.2]) have provided 
a very simple approach to more general existence and regularity using local 
convexification. Away from the boundary of S, a minimizing enclosure is an 
embedded curve of constant geodesic curvature k,), except possibly for finitely 
many geodesic arcs or isolated points where it bumps up against itself but remains 
C'. Even at the boundary of S the curve remains C' and the geodesic curvature 


1999] THE ISOPERIMETRIC PROBLEM ON SURFACES 433 


satisfies k < K, (weakly). If for bounded area the curve is allowed a large number 
of components enclosing disjoint regions, no curve bumps itself on the inside and 
K < K, everywhere. If the curve is allowed several nested components enclosing 
multiply connected regions, it never bumps itself. A word on the proof: if local 
convexification causes two pieces of curve to cross, the longer one is rerouted 
along the shorter. This process reduces length unless the curves were convex to 
begin with. Given convexity, standard variational arguments prove the rest. 

In many noncompact manifolds, such as the Euclidean or hyperbolic plane, one 
can work inside a large convex set. Hyperbolic surfaces and other surfaces can 
have thin cusps to infinity with nonconvex truncations, but as long as the area of 
the cusp is finite, any sequence of curves going off to infinity has area going to 0 
and may be discarded. 

Existence and some regularity hold as well for clusters in R? (enclosing and 
separating several regions of prescribed areas [21]) and in general dimensions by 
the techniques of geometric measure theory [19, Chapt. 13]. In higher dimensions 
you cannot hope to prescribe the topological type; for example, regions connected 
by thin tubes can disconnect in the limit. Even for curves in the plane, such general 
techniques do not have the topological control we need. 


6. CIRCULAR CYLINDERS. On the cylinder {x*? + y* =a*} CR’, the least- 
perimeter enclosure of area A is a small (round) circle for A < 4a’ and two 
horizontal circles for A > 47a’. 


Proof: We know that any solution consists of closed curves of constant curvature. 
If one curve is homotopically trivial and hence is a small round circle, it is the only 
one, or it could be translated to touch another and contradict regularity. If all the 
curves are homotopically nontrivial, there must be at least two of them to enclose 
area, and two horizontal circles are best. The transition occurs when the circumfer- 
ence of ue small circle 2V7rA equals the length of two horizontal circles 47a, ie., 
A = 47a’. 


7. FLAT TORI AND KLEIN BOTTLES (Howards [15, Thm. 3.1]). Let S be a flat 
torus or Klein bottle with shortest closed geodesic of length a. Given 0 < A < area S, 
the least-perimeter region of area A is 


(1) a circular disc if 0 < A < a*/7; 

(2) a band (possibly Mobius) with geodesic boundary if a*/m <A < area S — 
a’ / 1; 

(3) the complement of a circular disc if area S — a*/a7 <A < area S. 


Proof: Any solution consists of closed curves of constant curvature. As in the 
argument in Section 6, if one is homotopically trivial and therefore a small round 
circle, it is the only one or it could be translated to touch another and contradict 
regularity. If all the components are nontrivial, for any given area the perimeter is 
uniquely minimized by a single geodesic band with perimeter 2a. The transitions 
between types occur when the circle has circumference 2a. 


Remark. The round sphere and round projective plane may be treated by similar 
arguments [15, Thms. 4.1, 5.1] or by the methods of Section 9 [22, Thms. 3.1, 3.3]. 


8. CIRCULAR CONES. On the circular cone {z = ay¥x* + y?} CR’, the least- 
perimeter enclosure of area A is a horizontal circle. 


434 THE ISOPERIMETRIC PROBLEM ON SURFACES [May 


Figure 8.1. A constant-curvature curve about the vertex of the cone must be a circle of 
smaller circumference than a planar circle of the same area. 


Proof: If any component does not encircle the vertex, it must be the only compo- 
nent (or it could be translated to touch another component, contradicting regular- 
ity), and hence it must be a circle of length 2V7A . Consider a constant-curvature 
curve that encircles the vertex. It must be symmetric about the line through the 
vertex and a point most distant from the vertex, as in Figure 8.1, so it must be a 
horizontal circle. Clearly a single horizontal circle would have less perimeter than 
several. Since one horizontal circle has length less than 2/7A , it must be the 
minimizer. 


Remark. Actually for a simply-connected domain D on a surface with Gauss 
curvature K, the perimeter L satisfies 


L? > 4A —2A{ max{K, 0}, 
D 
with equality for the singular limit case of the cone [23, Thm. 4.3]. 


9. THE PARABOLOID AND OTHER SURFACES OF REVOLUTION. Sections 9 
and 10 provide some new examples. The following theorem and corollary include 
the paraboloid. The proof integrates the Gauss-Bonnet theorem. 


Theorem 9.1 ((24, Prop. 7], [22, Thm. 2.1], [29]). Consider the plane with smooth, 
rotationally symmetric, complete metric such that the Gauss curvature is a Strictly 
decreasing function of the distance from the origin. Then the unique length-minimizing 
simple closed curve enclosing a given area is a circle centered at the origin. 


By Section 5, inside a surface of finite area or inside a large convex ball B, for 
bounded area, there is a minimizer among C’ curves of m < m, components, 
enclosing m disjoint discs. Away from 0B, it is an embedded curve of constant 
geodesic curvature x,, except possibly for finitely many geodesic arcs or isolated 
points where it bumps up against itself. If m, is large, the curvature k < ky 
everywhere. | 


1999] THE ISOPERIMETRIC PROBLEM ON SURFACES 435 


A short proof of the following standard technical lemma is in [20, §9.7, p. 112] 
(cf. [22, Lemma 2.3]). The idea is that the rate of change of the perimeter 
is essentially the geodesic curvature, which is controlled by the Gauss-Bonnet 
theorem. 


Lemma 9.2. Let L(A) denote the least perimeter of m < m, discs of total area A. 
Then L(A) is differentiable almost everywhere and 


L Ay So) oe 
0 


Proof of Theorem 9.1 [22]: In the surface or inside a large convex ball, for m, large, 
let L(A) denote the length of a shortest curve of m < m, components enclosing 
m disjoint discs of total area A. First we claim that if L is differentiable at A, 
then L’(A) is the geodesic curvature ky. One geometric interpretation of geodesic 
curvature is the rate of change of length with respect to area under perturbations 
of the given minimizer [20, Chapt. 2]. Hence for AA > 0, the new minimizers must 
do at least as well as perturbations of the old one, and L'(A) < ky. Similarly for 
AA <0, —L'(A) < —ky. Therefore L'CA) = ky. 

Now Gauss-Bonnet tells us that the total Gauss curvature of the enclosed region 
equals 


277m — [« > 207 — L(A) ky = 27 — L(A)L’(A). 


Let G(A) denote the total Gauss curvature of a disc of area A centered at the 
origin. Since the Gauss curvature is a decreasing function of radius, any other 
region with the same area must have less total Gauss curvature. So we have 


2a — L(A)L'(A) < G(A), (9.1) 
and 

L(A)L'(A) = 2a - G(A). 
By Lemma 9.2, integration from A = 0 to A, yields 


L(A)’ > 2fL(A)L'(A) = 4A, - 2fG(A). (9.2) 


This inequality is sharp for a circle centered at the origin, as we can see by 
integrating the Gauss-Bonnet formula for circles centered at the origin with area 
A from 0 to A,. Hence equality holds in (9.2), L is differentiable everywhere, and 
equality holds in (9.1). Therefore a minimizer encloses Gauss curvature G(A) and 
must be a circle about the origin. 


The following general extension to several, perhaps multiply connected regions 
is deduced in [22, Thm. 3.1]. Here we give a proof for the easy case of positive 
Gauss curvature, which includes the paraboloid. 


Corollary 9.3. Among unions of disjoint, perhaps multiply connected regions, a 
perimeter minimizer exists and is a (round) disc, disc complement, or annulus about 
the origin. If the Gauss curvature is positive or the total Gauss curvature of every 
compact region is less than 27, then the minimizer is a disc. 


Proof for the case of positive Gauss curvature: By the Gauss-Bonnet theorem, the 
perimeter P(r) and geodesic curvature «(r) of a circle about the origin of radius r 


436 THE ISOPERIMETRIC PROBLEM ON SURFACES [May 


bounding a disc D satisfy 
P’=«xP=27- K. 9.3 
ie (9.3) 


< -a<0 
and P hits 0. By (9.3), « is positive and decreasing. Consider any collection of 
simple closed curves enclosing area A,. By discarding any curves inside others, 
enclose area A, > A,. By Theorem 9.1, each curve alone would be shortest if a 
circle about the origin. Since x = dP /dA is decreasing, one single circle about the 
origin is best. Since A, < A,, the circle of area A, is best of all. 


The total Gauss curvature is at most 277, since otherwise eventually P’ < 


10. HYPERBOLIC MANIFOLDS, We consider geometrically finite complete hy- 
perbolic surfaces (curvature —1). Such surfaces may be compact or have finitely 
many ends: cusps (with exponentially shrinking thickness and finite area) or flared 
ends (asymptotic to the hyperbolic plane). 


Theorem 10.1 [1, Thm. 2.2]. Let S be a hyperbolic surface. For given area) <A < 
area(S), a perimeter-minimizing system of embedded rectifiable curves bounding a 
region of area A consists of curves of the same constant curvature of one of four types: 


(I) a circle, 
(II) horocycles around cusps, 
(III) two “neighboring curves’ at constant distance from a geodesic, bounding an 
annulus or complement, 
(IV) geodesics or single neighboring curves. 


The total perimeter L satisfies 


L<VA?+40A, (10.1) 


with equality for a circle of area A. If S has at least one cusp, then cases (1) and (III) 
do not occur and L < A; if moreover A < 7, then a minimizer consists of horocycles 
bounding neighborhoods of an arbitrary collection of cusps and has perimeter L = A. 


Proof sketch: The constant-curvature curves on a hyperbolic surface are circles 
bounding discs («x > 1) or the complement («x < —1), horocycles around cusps 
(«x = 1) or the complement (x = —1), and constant-curvature curves around necks 
(|x| < 1, including the geodesics around the middle of necks with x = 0). 

A minimizer cannot have more than one circle, since sliding one until it hits 
another (or itself) would contradict regularity. Since for other types, dL /dA = xk is 
less than it is for a circle, (10.1) always holds, and there is an A, = 0 such that if 
A <A, the minimizer is a circle, while if A > Aj, it is not a circle and (for 
AA > 0) | 


AL/AA <1, (10.2) 


Now a computation shows that an annulus (or complement) as in Case (III) must 
occur alone, or an operation such as discarding it would contradict (10.2). There- 
fore the minimizer falls into one of the four asserted cases. 

Henceforth assume S$ has a cusp. Case (I) cannot occur, because sliding the 
circle out the cusp until it hits itself would contradict regularity. Hence the 
minimizer always has |x| < 1, and always L(A) < A. A computation shows that 
Case (III) cannot occur. 


1999] THE ISOPERIMETRIC PROBLEM ON SURFACES | 437 


Finally assume A < 7. We claim there is no minimizer with —1< k <1 and 
length L < A, so —A + «xL <0. Otherwise, applying Gauss-Bonnet to the en- 
closed region yields 


27x = —A + kL < 0, 
y¥< -1, A+kL < -27,kxL < -—7,kx<0,L > a7>A,a contradiction. 


The remaining possibilities, systems of curves with x = 1, consist of horocycles 
bounding cusp neighborhoods. Since x = 1, as you slide a horocycle out a cusp 
dL /dA = 1, and therefore its length equals the area of the cusp neighborhood. By 
the claim, such systems remain minimizing as long as they exist, either for all 
A < 7 or until they bump up against themselves at some A, < 7. If one bumps, 
by regularity the minimizer has perimeter less than A,, contradicting the claim and 
proving the theorem. 


ACKNOWLEDGMENT. Morgan’s work was partially supported by a National Science Foundation 
grant. 


REFERENCES 


1. Colin Adams and Frank Morgan, Isoperimetric curves on hyperbolic surfaces, Proc. Amer. Math. 
Soc., to appear. 7 
2. Itai Benjamini and Jianguo Cao, A new isoperimetric theorem for surfaces of variable curvature, 
Duke Math. J. 85 (1996) 359-396. 
3. M. Berger, Geometry IT, Springer-Verlag, New York, 1977. 
4. W. Blaschke, Vorlesungen iiber Differentialgeometrie II: Affine Differentialgeometrie, Berlin, 1923. 
5. T. Bonnesen and W. Fenchel, Theory of Convex Bodies (English translation of Theorie der Konvexen 
Korper), BCS Associates, Moscow, Idaho, 1987. 
6. R. Courant and D. Hilbert, Methods of Mathematical Physics, Vol. 1, Wiley & Sons, NY, 1953. 
7. C. Croke, Poincare’s problem on the shortest closed geodesic on a convex hypersurface, 
J. Differential Geom. 17 (1982) 595-634. 
8. Harold Gordon Eggleston, Convexity, Cambridge University Press, Cambridge, 1966. 
9. Joel Foisy, Soap bubble clusters in R? and R°, undergraduate thesis, Williams College, 1991. 
10. M. Gromov, Isoperimetric inequalities in Riemannian manifolds, Appendix I to Vitali D. Milman 
and Gideon Schechtman, Asymptotic Theory of Finite Dimensional Normed Spaces, Lecture Notes 
in Mathematics, No. 1200, Springer-Verlag, New York, 1986. 
11. Joel Hass, Michael Hutchings, and Roger Schlafly, The double bubble conjecture, Electron. Res. 
Announc. Amer. Math. Soc. 1 (1995) 98-102. 
12. Joel Hass and Frank Morgan, Geodesics and soap bubbles in surfaces, Math. Z. 223 (1996) 
185-196. 
13. Joel Hass and Roger Schlafly, Bubbles and double bubbles, American Scientist, Sept.—Oct., 1996, 
pp. 462-467. 
14. Joel Hass and Roger Schlafly, Double bubbles minimize, Ann. of Math., to appear. 
15. Hugh Howards, Soap bubbles on surfaces, undergraduate thesis, Williams College, 1992. 
16. Michael Hutchings, The structure of area-minimizing double bubbles, J. Geom. Anal. 7 (1997) 
285-304. 
17. Frank Morgan, Clusters minimizing area plus length of singular curves, Math. Ann. 299 (1994) 
697-714. 
18. Frank Morgan, The Double Bubble Conjecture, FOCUS, Math. Assn. Amer., December, 1995. 
19. Frank Morgan, Geometric Measure Theory: a Beginner’s Guide, Academic Press, second edition, 
Boston, 1995. 
20. Frank Morgan, Riemannian Geometry: a Beginner’s Guide, A. K. Peters, Ltd., second edition, 
Natick, 1998. 
21. Frank Morgan, Soap bubbles in R? and in surfaces, Pacific J. Math. 165 (1994) 347-361. 
22. Frank Morgan, Michael Hutchings, and Hugh Howards, The isoperimetric problem on surfaces of 
revolution of decreasing Gauss curvature, Trans. Amer. Math. Soc., to appear. 
23. Robert Osserman, The isoperimetric inequality, Bull. Amer. Math. Soc. 84 (1978) 1182-1238. 
24. Pierre Pansu, Sur la régularité du profil isopérimétrique des surfaces riemanniennes compactes, 
Ann. Inst. Fourier 48 (1998) 247-264. 


438 THE ISOPERIMETRIC PROBLEM ON SURFACES [May 


25. Thomas Isaac Porter, A history of the classical isoperimetric problem, in Contributions to the 
Calculus of Variations 1931-1932, Univ. of Chicago Press, Chicago, 1933, pp. 475-520. 

26. Manuel Ritoré, Constant geodesic curvature curves and isoperimetric domains in rotationally 
symmetric surfaces, preprint (1998). 

27. J. Steiner, Einfache Beweise der isoperimetrischen Hauptsatze, Crelle’s J. 18 (1838) 286-287. 

28. J. Steiner, Sur le maximum et minimum des figures dans le plan, sur le sphere, et espace en 
general, Crelle’s J. 24 (1842) 93-152, 189-250 (figures separate at end of fascicle). 

29. Peter Topping, The isoperimetric inequality on a surface, preprint (1997). 

30. Peter Topping, Mean curvature flow and geometric inequalities, J. Reine Angew. Math. 503 (1998) 
47-61. 


The three authors all spent time at Williams College, where Howards and Hutchings participated in 
Morgan’s NSF undergraduate research Geometry Group and Howards wrote his undergraduate thesis. 
Howards, who went to Williams and UC San Diego, is assistant professor of Mathematics at Wake 
Forest University. Hutchings, who went to Harvard, is Szego Assistant Professor of Mathematics at 
Stanford. Morgan, who went to MIT and Princeton, is Meenan Third Century Professor of Mathemat- 
ics at Williams College. In January, 1993, he received one of the first MAA national awards for 
distinguished teaching. 
Morgan has a biweekly Math Chat column and a weekly live call-in Math Chat on local cable TV, 
both available at the MAA web site at 
www. Iad. Org. 


Wake Forest University, Winston-Salem, NC 27109 
howards@wfu.edu 


Stanford University, Stanford, CA 94305 
hutching@math.stanford.edu 


Williams College, Williamstown, MA 01267 
Frank. Morgan @williams.edu 


The infinitude of the primes 

Is the subject of plenty of rhymes, 
But we can’t begin 
To prove there’s a twin 

An infinite number of times. 


Contributed by Peter Rosenthal, University of Toronto 


1999] THE ISOPERIMETRIC PROBLEM ON SURFACES 439 


What Is a Closed-Form Number? 


7 Timothy Y. Chow 


1. INTRODUCTION. When I was a high-school student, I liked giving exact 
answers to numerical problems whenever possible. If the answer to a problem were 
2/7 or 775 or arctan 3 or e'/°, I would always leave it in that form instead of 
giving a decimal approximation. 

Certain problems frustrated me because there did not seem to be any way to 
express their solutions exactly. For example, consider the following problems. 


Question 1. The equation 
x+e*=0 (1.1) 


has exactly one real root; call it R. Is there a closed-form expression for R? 


Question 2. The equation 
2x° - 10x +5=0 (1.2) 


has five distinct roots r,, r,, 3, r4, and r;. Are there closed-form expressions for 
them? 


Questions like this seemed to have a negative answer, but I continued hoping that 
the answer was yes, and that I just did not know enough mathematics yet. 

In college I learned about Galois theory, and that the Galois group of (1.2) is S; 
[7, Section 5.8]. So the r; are provably not expressible in terms of radicals. But 
although this probably should have satisfied me, it did not. Consider the equation 


x4 — (6y3 )x? + 8x? + (2V3)x -1=0. 


Its roots are tan(a7/15), tan(47/15), tan(77r/15), and tan(137/15). These seemed 
to me to be perfectly good closed-form expressions. Although in this particular 
case the roots could also be expressed in terms of radicals, it seemed to me that 
there might exist algebraic numbers that were not expressible using radicals but 
that could still be expressed in closed form—say, using trigonometric or exponen- 
tial or logarithmic functions. So as far as I was concerned, Galois theory was not 
the end of the story. 

When students ask for a closed-form expression for fexp(x*) dx, we all know 
the standard answer: the given function is not an elementary function. Curiously, 
though, Question 1 (as well as Question 2, if you accept my dissatisfaction with the 
Galois-theoretic answer) does not seem to have a standard answer that “everybody 
knows.” At most we might mutter vaguely that (1.1) is a “transcendental equation,” 
but this is not very helpful. 

This nonexistence of a standard answer to such a simple and common question 
seems almost scandalous to me. The main purpose of this paper is to eliminate this 
scandal by suggesting a precise definition of a “closed-form expression for a 
number.” This will enable us to restate Questions 1 and 2 precisely, and will let us 


440 WHAT IS A CLOSED-FORM NUMBER? [May 


see how they are related to existing work in logic, computer algebra, and transcen- 
dental number theory. My hope is that this definition of a closed-form expression 
for a number will become standard, and that many readers will be lured into 
working on the many attractive open problems in this area. 


2. FROM ELEMENTARY FUNCTIONS TO EL NUMBERS. How can we make 
Questions 1 and 2 precise? Our first inclination might be to turn to the notion of 
an elementary function. Recall that a function is elementary if it can be constructed 
using only a finite combination of constant functions, field operations, and alge- 
braic, exponential, and logarithmic functions. This class of functions has been 
studied a great deal in connection with the problem of symbolic integration or 
“integration in finite terms” [4], and it does a rather good job of capturing 
“high-school intuitions” about what a closed-form expression should look like. For 
example, in Question 1 above, it turns out that R= —W(1), where W, the 
Lambert W function [6], is the (multivalued) function defined by the equation 


W(x)e"™™ =x. 


But since W is not an elementary function [5], this is not an answer that would 
satisfy most high-school students. Similarly, if we allow various special 
functions—e.g., elliptic, hypergeometric, or theta functions—then we can explicitly 
express the 7, in Question 2, or indeed the roots of any polynomial equation, in 
terms of the coefficients; see [3] and [10]. But this again feels unsatisfactory 
because these special functions are not elementary. 

The concept of an elementary function is certainly on the right track, but 
observe that what we need for Questions 1 and 2 is a notion of a closed-form 
number rather than a closed-form function. The distinction is important; we 
cannot, for example, simply define an “elementary number” to be any number 
obtainable by evaluating an elementary function at a point, because all constant 
functions are elementary, and this definition would make all numbers elementary. 
Furthermore, even if a function (such as W) is not elementary, it is conceivable 
that each particular value that it takes (W(1), W(2),...) could have an elementary 
expression, but with different-looking expressions at different points. These diffi- 
culties can probably be circumvented with a little work, but we take a different 
tack; instead of trying to define closed-form numbers in terms of elementary 
functions, we give an analogous definition. 

We mention one more technical point. By convention, all algebraic functions 
(i.e., functions that satisfy a polynomial equation with polynomial coefficients) are 
considered to be elementary, but this is not suitable for our purposes. Intuitively, 
“closed-form” implies “explicit,” and most algebraic functions have no simple 
explicit expression. So the set of purely transcendental elementary functions is a 
better prototype for our purposes than the set of elementary functions. “Purely 
transcendental” simply means that the word “algebraic” is dropped from the 
definition. . 

With all these considerations in mind, we propose the following fundamental 
definition. 


Definition. A subfield F of C is closed under exp and log if (1) exp(x) € F for all 
x © F and (2) log(x) € F for all nonzero x € F, where log is the branch of the 
natural logarithm function such that — 7 < Im(og x) < zw for all x. The field E 
of EL numbers is the intersection of all subfields of C that are closed under exp 
and log. 


1999] WHAT IS A CLOSED-FORM NUMBER? 441 


Before discussing E, let us make some remarks about terminology. It might seem 
more natural to call E the field of elementary numbers, but unfortunately this term 
is already taken. It seems to have been first used by Ritt [18, p. 60]. By analogy 
with elementary functions, Ritt thought of elementary numbers as the smallest 
algebraically closed subfield L of C that is closed under exp and log. It so happens 
that terminology has evolved since Ritt, so that “elementary numbers” are now 
numbers that can be specified implicitly as well as explicitly by exponential, 
logarithmic, and algebraic operations, and L is now sometimes called the field of 
Liouvillian numbers [16]. But either way, calling E the field of elementary numbers 
would conflict with existing usage. The “EL” in the term “EL number” is intended 
to be an abbreviation for “Exponential-Logarithmic” as well as a diminutive of 
“Elementary”; it reminds us that E is a subfield of the elementary numbers. 

I should also remark that I am certainly not the first person ever to have 
considered the field E, but it has received surprisingly little attention in the 
literature and nobody seems to have lobbied for it as a fundamental object of 
interest, which in my opinion it is (as illustrated by my temerity in using “black- 
board bold” for it). 

Let us do a few warmup exercises to familiarize ourselves with E. We can 
construct E as follows. Set E, = {0}, and for each n > 0 let E, be the set of all 
complex numbers obtained either by applying a field operation to any pair of (not 
necessarily distinct) elements of E,_, or by applying exp or log to any element of 
F ,-13 of course, division by zero and taking the logarithm of zero are forbidden. 
Then it is clear that E is the union of all the E,. This shows in particular that E is 
countable, and that every element of E admits an explicit finite expression in terms 
of rational numbers, field operations, exp, and log. 

Many familiar constants lie in E, e.g., 


log( — 1) 
2 
Since 27ri € IE, we actually have access to all branches of the logarithm and not 
just the principal one, so all n of the nth roots of any x © E are also in E. It follows 


that all the roots of any polynomial equation with rational coefficients that is 
solvable in radicals lie in £. Finally, formulas such as 


2 log x | exp(ix) — exp(—ix) 
, sin x = ———_—__——_ 


e = exp(exp(0)), i= cx | and a= —ilog(—1). 


2/3 


Xx 


7 exp 2i 


exp(x) — exp(—x) log (x? — 1) 
tanh. =. “ad. arccos % = = 110g | x + exp.| ———— 
exp(x) + exp(—x) 2 
show that any expression involving “high-school” functions and elements of E is 
also in E. | 
We hope that this brief discussion has persuaded the reader that E is the “right 
precise definition of “the set of all complex numbers that can be written in closed 
form.” Accepting this, we can reformulate Questions 1 and 2 as follows. 


99 


Conjecture 1. The real root R of x + e* = 0 is not in E. 
Conjecture 2. The roots r,, r,, '3, t4, and rs of 2x° — 10x + 5 = 0 are not in E. 
As far as I know, Conjecture 1 and Conjecture 2 are—perhaps surprisingly—still 


open. Thus we are still frustrated, but at least our frustration has been raised to a 
higher plane. The next section of this paper is devoted to partial results. | 


442 WHAT IS A CLOSED-FORM NUMBER? [May 


3. SCHANUEL’S CONJECTURE. Conjecture 1 is essentially due to Ritt, except 
that he asked the question with 1 instead of E, since he was motivated by 
considerations different from ours. The best partial result I am aware of is due to 
Ferng-Ching Lin [12]. To state Lin’s theorem, we must first recall Schanuel’s 
conjecture. 


Schanuel’s Conjecture. /f a,, a,,..., a, are complex numbers linearly independent 
over Q, then the transcendence degree of the field Q(a,,e™, a,,e%,..., @,,e™ 
over © is at least n. 


Schanuel’s conjecture implies many famous theorems and conjectures about 
transcendental numbers. For example, it implies the Lindemann-Weierstrass theo- 
rem: If a,, @5,..., a, are algebraic numbers that are linearly independent over ), 
then e™,e%,...,e% are algebraically independent over @ [2, Theorem 1.4]. 
Schanuel’s conjecture also implies the Gelfond-Schneider theorem: If a, and a, 
are algebraic numbers for which there exist @-linearly independent numbers B, 
and B, such that a, =e" and a, =e”, then B, and 8B, are linearly indepen- 
dent over the algebraic numbers. Baker’s generalization [2, Theorem 2.1] of 
Gelfond-Schneider to an arbitrarily large finite number of a; also follows from 
Schanuel’s conjecture. It is an easy exercise (using e”' = —1) to show that 
Schanuel’s conjecture implies that e and 7 are algebraically independent, which is 
currently not known; it is not even known that e + 7 is transcendental. A proof of 
Schanuel’s conjecture would be big news, although at present it seems to be out 
of reach. 

Let @ denote the algebraic closure of @. Then Lin’s result is the following. 


Theorem 1. [f Schanuel’s conjecture is true and f(x, y) € Q[x, y] is an irreducible 
polynomial involving both x and y and f(a,exp(a)) = 0 for some nonzero a &€ C, 
then a €&éL. 


By taking f(x) =x + y and noting that E Cl, we see at once that Schanuel’s 
conjecture implies Conjecture 1. 

Conjecture 2 seems to be new. It is folklore that general polynomial equations 
(i.e., those with variable coefficients) cannot be solved in terms of the exponential 
and logarithmic functions, although nobody seems to have written down a com- 
plete proof; partial proofs may be found in [9, paragraph 513] and [1, p. 114]. The 
inexpressibility of an algebraic function in terms of exp and log does not, however, 
imply that particular values of an algebraic function cannot be expressed in terms 
of exp and log, just as some quintic equations with rational coefficients are solvable 
in radicals even though the general quintic is not. 

The remainder of this section is devoted to proving the following result. 


Theorem 2. Schanuel’s conjecture implies Conjecture 1 and Conjecture 2. 


As we just remarked, Lin has already shown that Schanuel’s conjecture implies 
Conjecture 1, but we shall exploit the fact that Conjecture 1 is weaker than the 
conclusion of Theorem 1 to give a shorter proof. The proof we offer for Theorem 2 
is joint work with Daniel Richardson. The reader may check that our arguments 
generalize readily to other transcendental equations such as x = cos x. The reader 
may also verify that the only property of (1.2) we use is the unsolvability of its 
Galois group. We therefore obtain the following corollary of the proof. 


1999] WHAT IS A CLOSED-FORM NUMBER? 443 


Corollary 1. Jf Schanuel’s conjecture is true, then the algebraic numbers in E are 
precisely the roots of polynomial equations with integer coefficients that are solvable 
in radicals. 


Thus, Schanuel’s conjecture implies that our notion of a “closed-form algebraic 
number” coincides with the usual one. 

Although Conjecture 1 and Conjecture 2 involve quite different kinds of 
equations, it turns out that there is a single concept (a reduced tower) that is the 
key to both. 

We need some preliminaries. If A =(a,,a@,,...,a,) is a finite sequence 
of complex numbers, then for brevity we write A, for the field 
Q(a,,e™, a,e™,...,a,,e™). In particular, A, = Q. 


Definition. A tower is a finite sequence A = (a,, a,,..., a@,) of nonzero complex 
numbers such that for all i € {1,2,...,m}, there exists some integer m, > 0 such 
that a;”'€ A,;_, or e*’”' © A,_, (or both). A tower is reduced if the set {a,} is 
linearly independent over @. If BEC, then a tower for B is a tower A = 
(a1, @>,..., @,) Such that B € A,. 


For any y € E, there exists a tower for y. This is best explained by example. 
Suppose y = 4 + log(1 + e“°8”/?), Then we may take 


A = (a, a, a3) = (log2, (log 2) /3, log(1 + e%8/*)). 


We can then take m, = 1 for all i, because e*' = 2 € A,, a, € A;, and e™@ € A). 
In general, we build up the expression for y step by step, and if at step 7 we need 
to take the exponential of some number B € A;_, we simply set a; = B; if we 
need to take the logarithm of some B € A,_, then we set a; = log B. With this 
construction, we never need to take m, > 1, but the tower we obtain may not be 
reduced (as is the case in this example: a, — 3a, = 0). In order to be able to use 
Schanuel’s conjecture, however, we need reduced towers, so our first goal is to 
show how to reduce a given tower. 


Division Lemma. Suppose A = (a,, a5,..., @,) is a tower and q,,q>,..-,G, are 
nonzero integers. Then the sequence B = ( B,, B,,..., B,) defined by B, = a;/q; is 
also a tower, and A, € B, for alli. 


Proof: Given any i € {1,2,...,}, note that every y € A, is a rational function 
(with rational coefficients) of the numbers a,,e“',..., a;,e%. Now a; = (a;/q,)q; 
= Bq, and e% = e&i/44% = (e*:)%i for all j, so y is also a rational function with 
rational coefficients of the numbers B,,e"',...,8;,e%, and hence y € B;. So 
A, © B, for all i. 


Given any i € {1,2,..., nm}, there is some integer m,; > 0 such that aj" € A,_, 
or e*’"' © A,_,. Consider first the case in which a;"' € A,;_,. Then 


By" = =) SA CB). 4: 
qi | 


If on the other hand e*”' €A,_,, then ek") = e%™" © A, , CB,_,. Hence 
there is a positive integer m’, (for example, m', = q,;m,) such that B/"' © B,_, or 
efi" © B._,.So B isa tower. = 


444 WHAT IS A CLOSED-FORM NUMBER? [May 


Reduction Lemma. For any y € E, there exists a reduced tower for y. 


Proof: lf y= @ then we may take A to be the empty sequence. Otherwise, 
suppose that every tower for y is not reduced; we shall derive a contradiction. 
Choose such an A with n minimal; since y € Q, n > 1. Let 7 be the smallest 
integer such that {a,, a,,..., a,} is linearly dependent. Then 


i-1 
Pj 
a= (3.1) 
jaa 
for some integers P,, G1; Po. 925---> Pi-1>Gj-1- We claim that the sequence 
i OD OF 
A = (=, ...., Os Hg Oh sien G0 | 
Gi 42 Gi-1 : 


is a tower for y. Since A’ is shorter than A, this contradicts the minimality of n 
and proves the theorem. 


To prove the claim, note first that by the division lemma, the sequence 


(= 2 get | 

ho 9 Gi-t 

is a tower. Next, note that (3.1) implies that a, € A’,_, and also, by exponentiating, 
that e“ is a polynomial (in fact a monomial) in the numbers e*!'/%,..., e%-!/4-1, 


so that e* € A’_,. By the division lemma, A,;_, C A_,, so A’;_, DA;_,(a;, e*) 
= A,. This ensures that the tower condition for A’ is satisfied at the boundary 
between a,;_,/q;_,; and a;,,, and also that A’,_, DA, 3 y, proving the claim. 


Proof of Theorem 2. We first make a general remark. If B = (B,, By,..., B,) is a 
reduced tower, then Schanuel’s conjecture implies that for all i, exactly one of B, 
and e¥i is algebraic over B,_,. For by the definition of a tower, at least one of the 
two is algebraic over B,_,; this implies that the transcendence degree of B,; over Q 
is at most 7 for all i. Then because B is reduced, Schanuel’s conjecture applies, so 
a, and e® cannot both be algebraic over B,_,, and the transcendence degree of B; 
over @ must be exactly 7. 


Now assume Schanuel’s conjecture. We first prove Conjecture 1. Assume that 
R €E; we derive a contradiction. By the reduction lemma, there is a reduced 
tower A = (aj, a,,..., a,) for R. Since e is transcendental, R € @ (for R = p/q 
implies e? = (—p/q)‘), so n => 1. By truncating the tower if necessary, we may 
assume that R € A; if i <n. 

Let A’ = (a, A>, ...,@,, R). Then R € A’, and the relation R + e® = (0 shows 
that e* € A’, as well. By our “general remark,” A” cannot be reduced. But A is 
reduced, so 


for some integers P,, G1; Pr; 4o>+++>Pn> G,- Moreover, p, # 0 because R € A, for 
i <n. The relation R + e* = 0 becomes 


ie j Oj . i 
y+ FL (ee8)" = 0. (3.2) 


i=1 4i 


1999] WHAT IS A CLOSED-FORM NUMBER? 445 


Let A" = (a,/q,; @/q2,---, @,/g,). By the division lemma, A” is a tower, and 
since A is reduced, A” is reduced. But since p, # 0, 3.2) shows that if a,/q,, is 
algebraic over A”, then so is e®/%, and vice versa. By our “general remark,” A” 
cannot be reduced, and this gives our desired contradiction. 


Now for Conjecture 2. We assume that the reader is familiar with the rudiments 
of Galois theory. Assume that r, € E; we derive a contradiction. By the reduction 
lemma there is a reduced tower A = (a,, @,,..., a,,) for r, (of course, unrelated 
to the “A” in the first part of this proof). For all i, let 


a;, if a; is transcendental over A,_,; 


B; = et 


i, if e® is transcendental over A,_,. 


Then the §; are algebraically independent and form a transcendence basis for A,, 
over @. Let F = Q(£B,, B>,..., B,). Clearly A, is an extension by radicals of F. 
Let L be the Galois closure of A, over F; then Gal(L/F) is solvable. If F’ is the 
splitting field over Q of the polynomial (1.2), then Gal (F’/Q) = S; and F'n F = Q 
because F’/@ is an algebraic extension while F/Q is a purely transcendental 
extension. Therefore the compositum FF’ is Galois over F with Galois group S, 
[11, Chapter 8, Theorem 1.12]. But FF’ C L, so S; must be a homomorphic image 
of Gal(L/F). This is our desired contradiction, because every homomorphic 
image of a solvable group is solvable. = 


4. RELATED WORK AND OPEN PROBLEMS. Much of the work that has been 
done on fields such as E, L, or the field of elementary numbers has been motivated 
by problems in logic and computer algebra. A typical problem is this: given a 
complicated expression for a number in E, how can you recognize if it equals zero? 
Clearly this is an important problem for designers of symbolic computation 
software. It is harder than it might seem at first glance, and is still not fully solved, 
although Richardson [17] has described an explicit procedure that takes a given 
elementary number and, if the procedure terminates, correctly says whether or not 
the number equals zero. He has also proved that if Schanuel’s conjecture is true, 
then the procedure always terminates. This more or less solves the zero-recogni- 
tion problem for elementary numbers (and a fortiori for E and L) in practice. 

The zero-recognition problem is closely related to a famous long-standing 
question of Tarski. Tarski proved that the first-order theory of the real numbers is 
decidable, which implies in particular that there is an algorithm for determining 
whether or not any given finite system of polynomial equations and inequalities has 
a solution in the reals [8, p. 340]. The proof proceeds by quantifier elimination, 
which we can think of roughly as follows: the statement that “there exists a 
solution” involves existential quantifiers, and quantifier elimination is a procedure 
for transforming such statements into ones that are quantifier-free. These are then 
easy to check because all that is involved is a zero-recognition problem for 
integers. After proving his theorem, Tarski asked if it could be extended to the 
first-order theory of the real numbers with exponentiation. 

This problem is very hard, because it turns out that quantifier elimination is not 
possible in this theory. Moreover, checking quantifier-free statements involves the 
zero-recognition problem for expressions with exponentials, which is difficult. 
Great progress has been made recently, however. Macintyre [13] showed that if 
Schanuel’s conjecture is true, then there is a decision procedure for the quantifier- 


446 WHAT IS A CLOSED-FORM NUMBER? | [May 


free statements. Then Wilkie proved in 1991 that the first-order theory of reals 
with exponentiation is model complete [19], which roughly means that quantifiers 
can “almost” be entirely eliminated. Building on this work, Macintyre and Wilkie 
showed that if Schanuel’s conjecture is true, then the first-order theory of the real 
numbers with exponentiation is decidable [14]. In particular, from these methods 
one can extract a zero-recognition procedure for elementary numbers (again, 
contingent on Schanuel). See [15] for a splendid account of these and related 
results. 

Zero-recognition in E should be easier than zero-recognition in the elementary 
numbers. Can one recognize zero in E without assuming Schanuel, or at least 
by assuming something weaker? The ideas of Macintyre [13] are a good starting- 
point here. 

Another interesting open problem, posed by Thomas Colthurst (in a sci.math 
article posted on June 21, 1993), is to produce an explicit example of a number 
that is not in E. Since E is countable, Cantor’s diagonal argument gives us an 
algorithm for producing the decimal expansion of a non-EL number, but this is not 
very Satisfying. It would be much nicer if we could prove, for example, that 
£(3) € E, but this seems difficult. Colthurst suggests that one might expect an 
expression of the form 


P= ¥ fom 


to work, where f(m) is a nonnegative function of m that approaches zero rapidly. 
For example, the sets E, (defined in Section 2) are finite, so there exists some 
€, > 0 such that any two distinct numbers in E, differ in absolute value by at least 
e,- If one could find f with the property that, for all n, 


y f(m) cE, and Yo f(m) <e, 
m=1 


m=nt+l1 


then F could not be in E, for any n. This is probably too naive, but perhaps 
something along these lines is feasible. Can f be chosen to be elementary? 

Steve Finch asks for the relationship between EL numbers and holonomic 
constants. A holonomic function is a solution of a linear homogeneous ODE with 
polynomial coefficients. A holonomic constant is a value of a holonomic function at 
a rational regular point. Singular holonomic constants are values of holonomic 
functions in the vicinity of a singular point. Several famous constants such as 7, 
€(3) and Catalan’s constant are singular holonomic constants. 

On the grounds that many high-school students are unfamiliar with complex 
numbers, one can ask for a “real analogue” of £. What is the right definition? 
Such a real analogue would lack many of the nice properties of E (e.g., recall that if 
an irreducible cubic with rational coefficients has three distinct real roots, then 
they cannot be expressed using radicals alone if complex numbers are forbidden), 
but it might still be interesting. 

Finally, we mention that Richardson (personal communication) has shown that 
if Schanuel’s conjecture is false, then there is a counterexample involving only 
elementary numbers. Can this be strengthened to show that any counterexample 
must lie in E? 

We hope the reader is tempted to attack these relatively untouched questions. 


1999] WHAT IS A CLOSED-FORM NUMBER? 447 


ACKNOWLEDGMENTS. The notion of an EL number occurred to me a long time ago and I have 
benefited from discussions with numerous people over the years. The ideas of Daniel Richardson and 
Thomas Colthurst have been particularly helpful and have had a profound influence on this paper. I 
would like to thank Alexander Barvinok, Alan Head, Jerry Shurman, Arpad Toth, and Robert Corless 
for helpful comments on the mathematics related to references [1], [5], [6], and [10]. Thanks also to 
Juergen Weiss, Alexander Pruss, and David Feldman for brief email messages that they may have 
forgotten by now but that helped point me in the right direction when I was getting started. Finally, I 
wrote most of this paper while I was an assistant professor in the University of Michigan mathematics 
department, where I was supported in part by a National Science Foundation Postdoctoral Fellowship. 


REFERENCES 


1. V.B. Alekseev, Abel’s Theorem in Problems and Solutions, Izdat. “Nauka,” 1976 (Russian). 

2. A. Baker, Transcendental Number Theory, Cambridge Mathematical Library, Camb. Univ. Press, 
1990. 

3. G. Belardinelli, Fonctions hypergéométriques de plusieurs variables et résolution analytique des 
équations algébriques générales, Mémorial des Sci. Math. 145 (1960). 

4. M. Bronstein, Symbolic Integration I: Transcendental Functions, Algorithms and Computation in 
Mathematics, Volume 1, Springer-Verlag, 1997. 

5. R.M. Corless, Is elementary? Math 498/990 notes, Nov. 23, 1995, 
http: // www. apmaths.uwo.ca/° rcorless / AM563 / NOTES / Nov_23_95 / Nov_23_95. 
html 

6. R.M. Corless, G. H. Gonnet, D. E. G. Hare, D. J. Jeffrey, and D. E. Knuth, On the Lambert W 
function, Adv. Comput. Math. 5 (1996) 329-359. 

7. I. N. Herstein, Topics in Algebra, 2nd ed., Wiley, 1975. 

8. N. Jacobson, Basic Algebra I, 2nd ed., W. H. Freeman, 1985. 

9. C. Jordan, Traité des Substitutions et des Equations Algébriques, Gauthier-Villars, 1870. 

10. R. Bruce King, Beyond the Quartic Equation, Birkhauser Boston, 1996. 

11. S. Lang, Algebra, 2nd ed., Addison-Wesley, 1984. 

12. F.-C. Lin, Schanuel’s conjecture implies Ritt’s conjecture, Chinese J. Math. 11 (1983) 41-50. 

13. A. Macintyre, Schanuel’s conjecture and free exponential rings, Ann. Pure Appl. Logic 51 (1991) 
241-246. 

14. A. Macintyre and A. J. Wilkie, On the decidability of the real exponential field, in Kreiseliana: 
About and Around Georg Kreisel, ed. P. Odifreddi, A. K. Peters, 1996. 

15. D. Marker, Model theory and exponentiation, Notices Amer. Math. Soc. 43 (1996) 753-759. 

16. D. Richardson, The elementary constant problem, in Proceedings of the International Symposium 
on Symbolic and Algebraic Computation, Berkeley, July 27-29, 1992, ed. P. S. Wang, ACM Press, 
1992. 

17. D. Richardson, How to recognize zero, J. Symb. Comp. 24 (1997) 627-645. 

18. J. Ritt, Integration in Finite Terms: Liouville’s Theory of Elementary Models, Columbia Unie. Press, 
1948. 

19. A. J. Wilkie, Model completeness results for expansions of the ordered field of real numbers by 
restricted Pfaffian functions and the exponential function, J. Amer. Math. Soc. 9 (1996) 1051-1094. 


TIMOTHY CHOW received his Ph.D. from M.I.T. in 1995 under Richard Stanley and then spent three 
years aS an assistant professor and NSF postdoc at the University of Michigan. Then, tired of 
publish-or-perish, he turned down a tenure-track offer from a Group I school in favor of a research 
position in industry. His main job now is to design the next generation of telecommunications 
equipment, but he continues to do mathematics and to participate in the M.I.T. combinatorics group. 
Lately, he has been filling much of his spare time with poetry, history, jogging, and Bible trivia. 
Tellabs Research Center, One Kendall Square, Cambridge, MA 02139, U.S.A. 

tchow@alum.mit.edu 


448 WHAT IS A CLOSED-FORM NUMBER? [May 


NOTES 


Edited by Jimmie D. Lawson and William Adkins 


The Smallest Solution of 
6(30n + 1) < 60n) Is... 


Greg Martin 


In a previous issue of this MONTHLY, D. J. Newman [1] showed that for any positive 
integers a, b, c, and d with ad # bc, there exist infinitely many positive integers n 
for which (an + b) < d(cn + d), where d(m) is the familiar Euler totient 
function, the number of positive integers less than and relatively prime to m. In 
particular, it must be the case that 6(30n + 1) < (30x) infinitely often; however, 
Newman mentions that there are no solutions of this inequality with n < 
20,000,000, and he states that a solution “is not explicitly available and it may be 
beyond the reach of any possible computers.” The purpose of this note is to 
describe a method for computing solutions to inequalities of this type that avoids 
the need to factor large numbers. In particular, we explicitly compute the smallest 
number n satisfying 6(30n + 1) < d@G0n). 

It is quite easy to compute values of n for which ¢(30n + 1) is relatively small 
by imposing many congruence conditions on n modulo primes, so that 30n + 1 is 
highly composite. However, the numbers n that arise in this way are quite large, 
having hundreds of digits. Computing (307) exactly relies on the factorization of 
30n, which for integers of this size is not possible to find in a reasonable amount of 
time with today’s computers and factoring algorithms. The idea underlying our 
method is to use partial knowledge of the factorization of a large number m to get 
an estimate for d(m). 


Claim 1. Let p, denote the i‘ prime number. Let q = I1'53,, p,; for some positive 
integers r and s, and let m be an integer that is not divisible by any of the primes 
Pi>-+->P,» Then: 

(a) ifm <q, then m has at most s distinct prime factors; 

(b) if m has at most s distinct prime factors, then 6(m)/m = $(q)/q. 


Proof: Let t be the number of distinct prime factors of m, and let the prime 
factors be p;,,..., p;, with 1, < -** <7,. Since none of the primes p,,..., p, divide 
m, it must be the case that i, >r+1, i, =>r+ 2, and so on. If we define 
k = [1j7,1 pj, we see that k < IT 1 Di, <m. But m <q by assumption; and so 
k <q, which can be the case only if t < s. This proves part (a) of the claim. 


For part (b), we use the fact that the function ¢(m)/m can be written as a 
product over primes dividing m: 


uss 


m 


1999] NOTES 449 


With & defined as above, notice that 


1 


ees el) 
Pr+j 


k ? 


since 1 — 1/p is an increasing function of p. On the other hand, since t < s by 
assumption, we have 


o(k) - 1 o(q) 
= 1 {i-—|2 1 (1-—]-"*. 
k j=rtl Pj j=rtl P; q 


since each 1 — 1/p is less than 1. This proves part (b) of the claim. 


We now proceed to find the smallest solution of d@30n + 1) < 6(@30n); our 
method applies to any inequality of the form (an + b) < (cn + d). Clearly 
30n + 1 = 1 (mod 30) for all n. Also, if n is a solution of d@G0n + 1) < d@0n), 
then 


30n+ 1 30n 30)n 4 
OE Dg SO, 2. eae... 
30n + 1 30n + 1 30n 15 


since the inequality d(ab) < $(a)b holds for all a and b. Thus it makes sense to 
look for numbers that satisfy both these conditions. 


Claim 2. Let z = (p,p5 °** P33) P35 P3gg- Then z is the smallest positive integer 
satisfying z = 1 (mod 30) and $(z)/z < 4/15. 


Proof: A computation shows that z is indeed congruent to 1 (mod 30) and that 


7 383 1 1 _ I 4 
p( ) - (T(r - =) }{2-][1- ) - ease... < 5. 
Z i=4 Pj P385 P3g8 15 


Suppose m is an integer satisfying m = 1 (mod 30) and ¢(m)/m < 4/15. Because 
of the congruence condition, m cannot be divisible by 2, 3, or 5. If we define 
q, = T17,p,, then 6(q,)/q, = 0.26671..., and so 6(q,)/q, > 6(m)/m. Thus if 
we apply part (b) of Claim 1 with r = 3 and s = 381, we conclude that m must 
have more than 381 distinct prime factors. 

Another computation reveals that the only numbers less than z that have at 
least 382 distinct prime factors are the numbers p,p; °*: P3.,m', where 


| 
me { P 383 P34 DP 385° P383 P384P386> P383P385P386> P383P384P387> P383P385 P387> 


P 384 P 385 P386> P383 P34 P388> P3383 P86 P3g7}; and none of these numbers is congruent 
to 1 (mod 30). 


Let us define n = (z — 1)/30, which by Claim 2 is both an integer and the 
smallest possible solution of #(30n + 1) < 6(30n). (Small wonder that we haven’t 
stumbled across any solutions of this inequality—n has 1,116 digits!) It would be 
quite gracious of 1 to be an actual solution, and indeed it is. 

First we show that #(30n + 1)/@G0n + 1) < f30n)/30n. We have already 
computed 


$(30n +1) o(z) | 
as lea (1) 


450 NOTES [May 


It turns out that 7 is divisible by both 60 and p47, = 47,279, so define n' = 
n/(60 P4974). We can compute that n’ is not divisible by any of the first 80,000 
primes. This computation can be done quickly by multiplying the primes together 
in blocks of 1,000, say, and computing the greatest common divisor of n’ and the 
product. Since computing greatest common divisors is a very fast operation, 
checking that n’ is not divisible by any of the first 80,000 primes takes only a few 
minutes on a workstation—much more reasonable than trying to factor a number 
with over a thousand digits. 

Now define gq, = I1f2s¢%, p;. We compute that g, has 1,118 digits and so 
gq, >n > n'. By using parts (a) and (b) of Claim 1 with r = 80,000 and s = 186, we 
see that f(n')/n' = $(q,)/q,. Therefore, since 6(ab) = 6(a)¢(b) when a and b 
are relatively prime, we compute 


30n 30 - 60 n’ 4 1 
b( ) _ p( Pag) f(r) " at 7 | b( 2) ~ 02666124... . 
30n 30 - 60 Pagra n' 15 47,279 q> 
(2) 


This shows that #(30n + 1)/(0n + 1) < 6@30n)/30n, which doesn’t quite 
imply that 4(30n + 1) < 6(30n)—only that 6(30n + 1) < 630n)(1 + 1/G0n)). 
However, the numbers computed in (1) and (2) differ in the sixth decimal place, 
while multiplying by 1+ 1/@0n) leaves a number unchanged until past the 
1,100th decimal place. 

Therefore the following theorem has been established. 


Theorem. The smallest solution of 6(30n + 1) < 6(30n) is 

n = 232, 909, 810, 175, 496, 793, 814, 049, 684, 205, 233, 780, 004, 859, 885, 966, 051, 235, 363, 345, 311, 075, 
888, 344, 528, 723, 154, 527, 984, 260, 176, 895, 854, 182, 634, 802, 907, 109, 271, 610, 432, 287, 652, 976, 
907, 467, 574, 362, 400, 134, 090, 318, 355, 962, 121, 476, 785, 712, 891, 544, 538, 210, 966, 704, 036, 990, 
885, 292, 446, 155, 135, 679, 717, 565, 808, 063, 766, 383, 846, 220, 120, 606, 143, 826, 509, 433, 540, 250, 
085, 111, 624, 970, 464, 541, 380, 934, 486, 375, 688, 208, 918, 750, 640, 674, 629, 942, 465, 499, 369, 036, 
578, 640, 331, 759, 035, 979, 369, 302, 685, 371, 156, 272, 245, 466, 396, 227, 865, 621, 951, 101, 808, 240, 
692, 259, 960, 203, 091, 330, 589, 296, 656, 888, 011, 791, 011, 416, 062, 631, 565, 320, 593, 772, 287, 118, 
913, 728, 608, 997, 901, 791, 216, 356, 108, 665, 476, 306, 080, 740, 121, 528, 236, 888, 680, 120, 152, 479, 
138, 327, 451, 088, 404, 280, 929, 048, 314, 912, 122, 784, 879, 758, 304, 016, 832, 436, 751, 532, 255, 185, 
640, 249, 324, 065, 492, 491, 511, 072, 521, 585, 980, 547, 438, 748, 689, 307, 159, 363, 481, 233, 965, 802, 
331, 725, 033, 663, 862, 618, 957, 168, 974, 043, 547, 448, 879, 663, 217, 971, 081, 445, 619, 618, 789, 985, 
472, 074, 303, 100, 303, 636, 078, 827, 273, 695, 551, 162, 089, 725, 435, 110, 246, 701, 964, 021, 045, 849, 
081, 811, 604, 427, 331, 227, 553, 783, 590, 821, 510, 091, 607, 567, 178, 842, 569, 576, 699, 548, 038, 217, 
673, 171, 895, 383, 249, 326, 800, 667, 432, 993, 531, 186, 437, 659, 910, 632, 865, 419, 892, 370, 957, 722, 
154, 266, 351, 039, 808, 548, 150, 828, 868, 968, 820, 675, 198, 820, 381, 135, 523, 646, 361, 202, 383, 915, 
218, 571, 017, 801, 463, 011, 491, 108, 784, 343, 253, 284, 393, 511, 650, 254, 506, 597, 923, 969, 653, 616, 
813, 897, 710, 621, 756, 693, 827, 471, 154, 701, 151, 222, 320, 443, 347, 408, 180, 047, 964, 860. 


ACKNOWLEDGMENTS. I thank Mike Bennett for verifying my computations and acknowledge the 
support of National Science Foundation grant DMS 9304580. 


REFERENCES 


1. D. J. Newman, Euler’s ¢ function on arithmetic progressions, Amer. Math. Monthly 104 (1997) 
290257; 


University of Toronto, Toronto M5S 3G3, Canada 
gerg@math.toronto.edu 


1999] NOTES 451 


A Matrix Representation for Euler’s 
Constant, y 


Frank K. Kenter 


Euler’s constant, y = 0.5772156649... can be represented as the product of an 
infinite row vector, the inverse of a Z* xX Z* lower triangular matrix, and an 
infinite Z* < 1 column vector, all with entries that are either zero or simple unit 
fractions. 

Observe that for Z*x Z* lower triangular matrices, the end result of all 
arithmetic matrix operations, matrix inversion, application to infinite column 
vectors, etc., has appropriate n-th truncation equal to that obtained by first 
truncating all matrices, and then carrying out the operations. Hence these opera- 
tions may be performed and the familiar identities of linear algebra continue to 
hold in this context. 


Theorem. Let u be the row vector {u, =1/k:k € Z*}, where Z* denotes the 
positive integers, and let M be the matrix with entries {m,, = 1/Gi — j + 1) if j <i, 
m,, = Oif j >i:i,j © Z*}. Let v be the column vector {v, = 1/(n + 1):n © Z*}. 
Then the product u(M~'v) exists (as a convergent series), and is equal to Euler’s 
constant, 


Sl[R 


y= lim | s. 
n=] 


ma70& 


~In (m) ~ 0.5772156649... . 


Explicitly, 


=] 


2 UR Ale Ble NIA RR 
2 al BIR NIE ER © 
WF NF eS OC © 
NF eS Oo O&O O&O 
en oO OP ED SED) 


Proof: Substituting t = 1 — e * in the standard definite integral 


* 1 1 di i “4 t dt 
— et ae Tx + _ { —$._—_____. —_- 
: fica =e pen i | In(1—t)]¢ 


Write 


452 NOTES [May 


where the coefficients c, are obtained from the formal division of power series. 
Since 


y%_,c,t* converges on some interval around 0, and 1 = (L3_,t*~' /kMUp_9 ce, t*) 
on this interval. Since c, = lim,_,, — t/In( — t) = 1, this is equivalent to the 
system of linear equations 


1 

i; . 
Bye 

1 ¢, ¢ 
“a 

LL t . 4 
“oa oo 


This system has the matrix form: 


1 

a 1 0 0 0 C, 
1 1 
Pes) oh 1 i. va 

“a| |3 2 
1 1 1 1 ; 

5 4 3 2 zs 


Thus we have —v = Mc, where ec denotes the column vector {c,:k € Z*}. 
Eliminatirig the constant terms between two successive equations where c,_, and 
c, have unit coefficients, we have 


1 k 
J 


ora | . 
J 
k+1 L Ca ee 


Ci. = 


Consequently, by induction, c, < 0,c, < 0,---,c,_, <0 implies c, < 0, and again 
by induction -1/(k + 1) <c, <0 (k > 1). The latter inequality assures the con- 
vergence of ¥7_,c,/k. Therefore, using Abel’s Theorem, we have 


7 aad dt aoe 6 oe 
y= — lim Yet |—=- lim YY xt = - Y= -ue. 
t 2G. 2K nay 


Then c = (M-'!M)c = M_|(Mc) = —M ‘vy. Using y = —uc, we obtain the result 
y = u(M 'y). 


1999] NOTES 453 


We note that M is an unbounded operator on /,. For example, using the 
Euclidean norm the sequence of unit vectors defined by 


1 
w, = \{W,}, = ~~ forn<nm, and = Ofor n >] 


m im 


transforms into the sequence Mw,,, which diverges because 


71 9 


‘ 1 m™ ny 2 
Mw, [7 =— 3 | 3 Z| 
M p=1\k=1 
grows faster than In(m!)/m = In m, as m increases. 


2170 Monterey Avenue, Menlo Park, CA 94025 
frank.kenter@smi.siemens.com 


More on a Mean Value Theorem 
Converse 


H. Fejzic and D. Rinne 


In a recent MONTHLY article Tong and Braza considered two possible versions of a 
converse to the Mean Value Theorem [2]. For c € (a, b), a continuous function f 
on [a, b] that is differentiable on (a, b) satisfies the 


1. Weak Form at c if f’(c) = eee for some interval (a, B) € (a, b), 
and the 
: ! -_ f(B) — f(e@) : 
2. Strong Form at c if f'(c) = ae oa for some interval (a, B) C (a, b) 
with c € (a, B). 


In [2] the authors give a function that fails the Weak Form (and so fails both 
forms) at all values in a countable closed set. Borwein and Wang provided a 
function that fails the Weak Form on a residual set (one whose complement is of 
first category) that is of Lebesgue measure zero [1]. 

We show that a differentiable function can fail the Weak Form on a set that is 
both residual and of relative measure arbitrarily close to 1 while the Strong Form 
must hold on some subset of positive Lebesgue measure. In the rest of this Note 
measure means Lebesgue measure, denoted by 4. 

We consider [a,b] = [0,1] and let Z be any measurable set in [0,1] with 
MZ) <1. Let E c[0,1]\ Z be an F, set with ACE) = A((0,1]\Z) > 0 and E 
having density 1 at each x € E(lim,_,,A(E NM (x — €,x + €))(2e)' = 1). Let g 
be an approximately continuous function (at each x the restriction of g to some 
subset with density 1 at x is continuous at x) such that: 


1.0<g(x) <1 forxe£E, and : 
2. g(x) =0 forx EE. (1) 


A construction of such functions can be found in Zahorski [3]. Since g is bounded 


454 NOTES [May 


and approximately continuous it is the derivative of its integral f(x) = /{g(t) dt. 
Therefore f’ = 0 on Z. We can pick Z to be dense in [0,1] and of measure 
arbitrarily close to 1 with E having positive measure in every subinterval of [0, 1]. 
Then f is strictly increasing and thus has no difference quotient equal to zero. 
Hence f fails the Weak Form at every point of {x|f’(x) = 0} and thus at every 
point of Z. Since {x|f’(x) = 0} is a dense G;, (it’s the complement of the F,, set E), 
f fails the Weak Form on a residual set. 

However, the following theorem shows that a differentiable function cannot fail 
the Weak Form almost everywhere. 


Theorem 1. /f fis a continuous function on (a, b] that is differentiable on (a, b), then 
f satisfies the Strong Form on a subset of [a, b] that has positive measure in every 
subinterval. 


Proof: Let [a, B] < [a, b]. We may assume that f is not linear on any subinterval 
of [a, B] since it would then obviously satisfy the Strong Form there. Let 


eae LIAS fora<x<fB 
f'(a) for x = a 


Then f is continuous on [a, B] and hA([a, B]) is some nondegenerate interval 
[r, s]. Since h can have only countably many local extrema we can pick u € (a, B) 
so that h(u) is not a local extremum. Let c be a point in (a, u) with f’(c) = hu). 
Using p = (c + u)/2 we see that f’(c) is in the interior of h([ p, B]). Call this 
interior J. Let g be the restriction of f to the interval [a, p]. Then G = (g’)"'() 
# ¢ since it contains c and thus A(G) > 0 by the Denjoy-Clarkson Property (the 
inverse image under a derivative of an open interval is either empty or of positive 
measure). For each x € G, there is a y €[p, B] with f(x) =g'(x) =hA(y) = 
(f(y) — fla))/(y — a). Since a <x < y, f satisfies the Strong Form at x. a 


As a final comment, we point out that a differentiable function can fail the 
Strong Form on a set of positive measure and still satisfy the Weak Form on all of 
(a,b). As an example we can simply extend our function g in (1) to the interval 
[0, 4] as follows: Let 


g(x) O<x<1 

eas =e (L(y = 2). ke 
Ce i, 

(t= 3) ox =4 


and set F(x) = {{G(t) dt. Then F still fails the Strong Form on the set Z above 
but satisfies the Weak Form on (0,4). This is because 0 < G = F’ < 1 on (0,4) 
while the difference quotients for F inside the interval (2,4) assume all values in 
[O, 1). 


REFERENCES 


1. Borwein, J. M. and Wang, Xianfu, The Converse of the Mean Value Theorem May Fail Generi- 
cally, Amer. Math. Monthly 105 (1998) 847-848. 

2. Tong, J. and Braza, P., A Converse of the Mean Value Theorem, Amer. Math. Monthly 104 (1997) 
939-942. 

3. Zahorski, Z., Sur la Premiére Dérivée, Trans. Amer. Math. Soc. 69 (1950) 1-54. 


California State, University San Bernardino, CA 92407 
hfejzic@mail.csusb.edu, drinne@mail.csusb.edu 


1999] NOTES 455 


An Elegant Continued Fraction for 7 


L. J. Lange 


The regular continued fraction for 7 begins as follows [3, p. 23]: 
1 1 1 1 1 2 21 1 4 1 ~«21 1 
Te ee et Be ee ke ae eg ee i fe ee ee eg ee ede ale) 

6 falas coe oa Ua eA te ae ea i a a ale) la as a a a 


There is no known regularity to the partial denominators in (1) and the only known 
means to obtain them is to compute them one-by-one from a known decimal 
approximation for 7, Lord Brouncker (1620-1686), the first president of the Royal 
Society of London, gave (without proof around 1659) the first recorded infinite 
continued fraction [3, p. 2]: 


4 1? 32 5? 7? 9? 11° 13? 

—=14+-—-> —- FS FS TF marl (2) 

7 2+24+2+24+2+2 +2 $07. 
In 1775, according to [1, p. 131], Euler gave a proof of the validity of (2) by showing 
that 
x 1x" 37x Oe ae 
seg a ae a 7 on? Ses 
is equivalent to the power series representation 
( = Py eer 


2n+1 


arctan x = 


(3) 


arctanx = )) , -l<x<l. 


n=0 


Brouncker’s result can be obtained by setting x = 1 in (3). 

The following continued fraction expansion for the principal branch of the 
analytic function arctan z, valid for all z in the complex plane not on the imaginary 
axis from i to +i and from —i to —ic, is well known (3, p. 202]: 


ge Wee “Deg? Beye - Ae j 
1+ 3 + 5 + 7 + 9 Fer! (4) 
Setting z = 1 in (4) leads to 

T 1 1? th 3? 4? 

ie aa eee (5) 

A ae a a SP a ae 


arctan z = 


Although they are not formulas for 7 itself, the classical continued fractions (2) 
and (5) are attractive because of the simple expressions for all of their partial 
numerators and denominators. Our contribution is the following continued frac- 
tion for 7 itself, whose partial numerators and denominators are easily described 
and remembered. Though the tools to derive it have long been available, to our 
knowledge, this formula has not yet appeared in the literature. 


Theorem 1. 
: 17 3° 5? T 9? 11? 137 Z 
ee edie ce eG eG ee (2) 


456 NOTES [May 


Proof: We think it is of interest to show in several different ways that (6) is valid. 
Perron [5, p. 35] gives the following representation, which he attributes to Stieltjes: 


x+3+hn x+3-n 
Vn 3?-n* 5*%-7n? r= |r{ =| 
ee 05 as a)? a ne). me ys x+1l1+n x+1-—n\’ 
; sae cat oe 
(7) 
where x > 0 and 1>n* > —©. Setting n = 0 in (7) gives 
x+3 X+'3 
es * iealew “s 
2X oe 26a 2 oe r(=)r(] 


which is a formula also obtained by Ramanujan and Preece according to Perron 
[5, p. 36]. To obtain (6) we have only to substitute x = 3 in (8) and employ the 
properties [(1/2) = Va, ) = 1, and I(x + 1) = xI(x) of the I-function. It is 
surprising that apparently Ramanujan either was not aware of, or else did not 
choose to record this result. To show how we really arrived at (6) the first time, we 
need the following result [5, Satz 1.13, p. 28] relating to what are known as 
Bauer-Muir transformations of continued fractions; see [4]. 


Theorem 2. (a) If both continued fractions 
ar Bo egy aN 


2 3 

2 ra ay~ 

1 ~ ~ 
Dos pee ghee a 

1 P2 

where ~, = a, — r,_,(b, + 1r,), have positive elements and if both converge, then they 
have the same value. (b) If the first continued fraction has positive elements and it 
converges and if r, = 0 from a certain v on, then the second continued fraction also 


converges and it has the same value as the first. 
The second continued fraction in Theorem 2 is called the Bauer-Muir transform of 
the first one. On page 35 of [5] is the expansion 

mz  P—-z* 32-22 52-272 72-7? 


24. , (9 
renee eo a a ee ee | 


which is valid for all complex z. If we apply Theorem 2 to this continued fraction 
with z =x € (—1,1) and 


a, =(2n—1) —x?, b,=2, r,=2n-1, 9, =4-2x’, 


Hn 


we obtain 


Th OWSsexe eg Bags Se a7? Faas 
x cot aa 


1999] NOTES 457 


Taking the limit of both sides of (9) as z — 0 gives Brounker’s result (2), and 
taking the limit of both sides of (10) as x — 0 leads to (6) upon taking réciprocals. 

It would be nice if the speed of convergence of (6) was in accordance with its 
beauty, but unfortunately this is not the case. In support of this slowness assertion 
the 100th approximant of (6) rounds to 3.14159241, whereas both 7 and the 4th 
approximant of its regular continued fraction expansion (1) round to 3.14159265. If 
the expansions (2) and (5) are used to approximate 7, the 11th approximant of (5) 
gives 3.14159265 as an approximation, but Brouncker’s continued fraction (2) 
converges so slowly that its 1000th approximant leads to the poor estimate of 
3.14259165 for a. As another source of information about 7, we recommend to 
the reader the recent book [2]. 


Addendum:. The formula (6) was used as a logo for the conference on continued 
fractions that was held at the University of Missouri-Columbia in late May 1998. 
At this conference D. Bowman of the University of Illinois mentioned in a 
personal conversation that he had anothér approach to deriving (6). Bowman starts 
with the result 


7-3 © (-1)" iL. contd 1 4 
Ss ee ae oo 
4 poy 2A(2Kk + 1)(2Kk +2) 8 4,7, kK k+1 2k+1 
(11) 
and then makes use of the fact that for a, # 0 the series £7_,(—1)*~' /a, and the 
continued fraction 


2 2 2 
1 a; a a; 


SS ee ee (12) 

a, + ay—a, + a,—a, + Aag-— a, +" 
are equivalent, that is, the nth partial sum of the series and the nth approximant 
of the continued fraction are equal. This connection between series and continued 
fractions can be derived easily from a result of Euler (see [5, p. 17] or [3, p. 37]), or 
it can be proved directly by induction. After replacing a, by 2k(2k + 1I)Qk + 2) in 
(12) and calculating a,,, — a, = 24(k + 1)’, we are led to the representation (6) 
through a simple cancellation process that preserves the equivalence of the 
continued fractions involved. Bowman mentioned that his approach to verifying (6) 
gives aS a welcome by-product Some immediate truncation error information. 
Because of the series-continued fraction equivalence and the alternating nature of 
the first series in (11), we have |w —f,| < 1/(n + 1I(m + 2)(2n + 3)), where f, 
is the nth approximant of (6). 


REFERENCES 


1. P. Beckmann, A History of Pi, The Golem Press, Boulder, Colorado, 1977. 
. L. Berggren, J. Borwein, and P. Borwein, Pi: A Source Book, Springer-Verlag, New York, 1997. 

3. W.B. Jones and W. J. Thron, Continued Fractions: Analytic Theory and Applications, Encyclope- 
dia of Mathematics and Its Applications, Vol. 11, Addison-Wesley, Reading, MA, 1980. 

4. L. J. Lange, Continued fraction representations for functions related to the gamma function, 
Continued Fractions and Orthogonal Functions, Lecture Notes in Pure and Applied Mathematics 154 
(S. J. Cooper and W. J. Thron, eds.), Marcel Dekker, New York, 1994, pp. 233-279. 

5. O. Perron, Die Lehre von den Kettenbriichen, Band II, Teubner, Stuttgart, 1957. 


University of Missouri, Columbia, MO 65211 
jerry@math.missouri.edu 


458 NOTES [May 


The Reciprocity Law for Dedekind Sums 
via the Constant Ehrhart Coefficient 


Matthias Beck 


1. Introduction. The Dedekind sum can be defined for two relatively prime 
positive integers a, b by 


oa 1 y mwka =k 
,b) = — t——cot—. 
(a, b) Ab a 5 cot 


These sums appear in various branches of mathematics: number theory, algebraic 
geometry, and topology; they have consequently been studied extensively in various 
contexts. These include the quadratic reciprocity law [13], random number genera- 
tors [12], group actions on complex manifolds [9], and lattice: point problems ((14] 
or [5]). Dedekind was the first to show the following reciprocity law [3]: 


1 1 
3(a,b) + 3(b,a) = -7 + — 


a 1 b , 
+—+ 
b ab -| ) 


He was led naturally to this reciprocity law by considering the 7-function »(7) = 
em't/2 TT" _.( — e*7'’"7) on the complex upper half plane and transforming it 
under the action of the modular group SL,(Z). 

Gau8’s law of quadratic reciprocity, for example, follows easily from (1); see [13] 
or [16]. We note that 3(a, b) = 8(a mod b, b). Combining this with the reciprocity 
law (1), one obtains a polynomial-time algorithm for computing 8(a, b), similar in 
spirit to the Euclidean algorithm. From this point of view, it is not surprising 
(though not obvious) that3(a,b) can be expressed efficiently in terms of the 
continued fraction expansion of a/b; see [8] or [19]. 

Rademacher was one of the pioneers in the use of Dedekind sums [17]; in fact, 
he found several proofs of (1) [16]. We present yet another proof, which establishes 
a simple connection with lattice point enumeration in polytopes. The reciprocity 
law (1) follows readily once the reader is familiar with the computation of the 
coefficients of the Ehrhart polynomial for a lattice polytope. 


2. COUNTING LATTICE POINTS. Let Z” CR” be the n-dimensional integer 
lattice, and let # be an n-dimensional lattice polytope in R”, so F is a compact 
simplicial complex of pure dimension whose vertices lie on the lattice. For 
t € N, denote by L(Y, t) the number of lattice points in the closure of the dilated 
polytope tF := {t.: x € PF}. Ehrhart proved that L(Y, rt) is a polynomial in ¢ of 


degree n [6]. Moreover, 
L(P7,t) = Vol(P)t" + 5Vol(AF)t"" | ++ +x(F). 


Here, Vol(e#) denotes the surface area of # normalized with respect to the 
sublattice on each face of #, and y(#) is the Euler characteristic of A. We note 
that, for convex polytopes FY, v( PA) = 1 [6]. 


1999] NOTES 459 


In this paper, we focus on the case R*, where Ehrhart’s result is known as Pick’s 
Theorem; see [7] or [4]: For a convex lattice polytope 7 © R?’, 


L(P,t) = At? + $Bt+ 1, 


where A is the area and B is the number of boundary lattice points of 7. 

In the general case, the other coefficients of L(A, t) are not as easily accessible. 
In fact, until quite recently a method of computing these coefficients was un- 
known. There has been recent progress in this direction ({1], [2], [10], and [11)); 
Diaz and Robins found a way of proving a cotangent representation for the 
generating function “7_, LCF, t)e-*™", thereby deriving a formula for the Ehrhart 
coefficients of L(A, t) [5]. For our purposes, the following result (a straightfor- 
ward consequence of [5, Corollary 1]) is sufficient: 


- Theorem. Let F denote the simplex in RR" with the vertices (0,...,0), (a,,0,..., 0), 
(0, a,,0,...,0),...,(0,...,0,a,), where a,,...,a, © N are pairwise coprime. De- 
note the corresponding Ehrhart polynomial by L(,t) = L7_yc,t!. Then c,, is the 
coefficient of s~“"* in the Laurent expansion at s = 0 of 


m+1 Pp 


ps 


r=1 


17 


m!| yea 


IT 
1 + coth—(s + ir) 
ay 


= , 
1 + coth—(s + ir) 
Go 


7 
1 + coth—(s + ir) 


n 


IT 
f + coth—(s + i), 
p 


where p = a, °°: @,,. 

The appearance of cotangent products in this result leads us to expect Dedekind 
sums in some form within the coefficients of the Ehrhart polynomial, thus also 
within the formulas for the number of lattice points in simplices. In fact, the 
nontrivial cases of dimension three [15] and four [18] involve classical Dedekind 
sums. Both formulas can be obtained easily through the Theorem. © 

We use this result in an indirect way. Precisely, we compute c, according to the 
Theorem, and make use of the fact that c, = y(A) = 1. Dedekind’s reciprocity 
law (1) follows from this idea if we consider the case of dimension n = 2. 


3. PROOF OF THE RECIPROCITY LAW. According to the Theorem, for 
coprime a and b we have to find the coefficient of s~' of the Laurent series at 
s =O0of 

ab 


TT 17 TT 
tab * 1 + coth—(s + ir))(1 + coth=-(s + ir))( + coth—"(s + ir)|. (2) 


The Laurent expansion of each factor depends on r: 


c 7 
S.= —s7'+14+ —s+O(s*) ifelr 
7 3c 


ioe HAE 
1+ coth—(s + ir) = ae | 
R, = 1+ coth— + O(s) ifctr 
C 


To keep track of the various cases, we introduce the notation 


= 1 if clr 
Oifctr, 


Cc 


460 NOTES [May 


so that we can write 1 + cothz(s + ir)/c = S.v, +R. — y,), and (2) becomes 
ab 


DL (Sa Xa + Rall — X))(SpX5 + Rol — X5)) (Sab Xav + Rav(1 — Xav)): 
r=1 


Now, expand this into all 8 terms, and consider each summand according to the 
number of S. factors: 


1. Terms with one S, factor are 
Sa Xa R,(1 — Xp) Rap - Xa) = ~~ S, R p Ray XaC1 — Xp — Xab + Xav) 
= = S,R,Rao( Xa Xab) (3) 
and, similarly, 
R,(1 i Xa) Sp xXpRapA = Xab) =; R Sp R gpl Xb ~ Xab): (4) 
The summand with S,, is zero (note that y, X¥,, = XsXep = Xap» and 


X. Xb = Xap). TO compute the contribution of G), note that the support 
of yx, — X,, in (1,..., ab} is {ka:1 < k < b — 1}; thus its contribution to 


(2) is 
T a Ss A 3 a ika : 7 ar ika 
on + + cot 
Aab- 1 a i \ om ab | 


1 47! ‘hess arka se awk 
| - scot | - icot | 


om 

1 bor wka wk 1 1 ‘ 
=e =COl—— C0 Le SS ee as 

4b CO fi co 5 l A Ab 8(a,b) 


The imaginary part in the preceding sum has to be zero, because the 
original generating function is real. Similarly, (4) gives a contribution of 
+ — ja~' — 8(b, a). 
2. thee. are no terms with two S. factors, because 
SAR) =X) HS Sp Rope = Xab) =) 
and 


S, x, R,(1 - Xp) Sab Xop = S,R bab Xap(1 — Xp) =U; 


3. Finally, the term S, v,5,%5Sap Xap = 9¢5p5%ab Xap bas support {ab}, and 
gives a contribution of 


aw (ab @w aab wv bab 7 a b ab 
ee 

(<5 a 7 3b wT 7 3a T =| 
1/1 ‘a b Ld 1 , 
=—({—+—4+-]+-{-+-H4+1]. 
sla+etsleals a | 


Adding all contributions, we arrive at 


3 de esd ab 
=co=—-+—|—+—-+-]-8 —3 
1=cy 1 = als ; | (a,b) — 3(b,a), 


the desired reciprocity law (1). 


The same method applied to dimension n = 3 does not give any further results. 
However, for n = 4, higher dimensional Dedekind sums [20] appear within the 
computations, so that this case is likely to provide new results. 


ACKNOWLEDGMENT. I thank Sinai Robins for helpful suggestions and invaluable support. 


1999] NOTES 461 


REFERENCES 


1. 


Z 


A. I. Barvinok, Computing the Ehrhart polynomial of a convex lattice polytype, Discrete Comput. 
Geom. 12 (1994) 35-48. 

M. Brion and M. Vergne, Lattice points in simple polytypes, J. Amer. Math. Soc. 10 (1997) 
371-392. 

R. Dedekind, Erlauterungen zu den Fragmenten XXVIII, in Collected Works of Bernhard 
Riemann, Dover Publ,. New York, 1953, pp. 466-478. 

R. Diaz and S. Robins, Pick’s formula via the Weierstrass g-function, Amer. Math. Monthly 102 
(1995) 431-437. 

and , The Erhart polynomial of a lattice polytype, Annals of Math. 145 (1997) 


503-518. 
E, Ehrhart, Sur un probléme de géométrie diophantienne linéaire II, J. reine angewandte Math. 
227 (1967) 25-49. 

B. Griinbaum and G. C. Shephard, Pick’s Theorem, Amer. Math. Monthly 100 (1993) 150-161. 
D. Hickerson, Continued fractions and density results for Dedekind sums, J. reine angewandte 
Math. 290 (1977) 113-116. 

F. Hirzebruch and D. Zagier, The Atiya-Singer index theorem and elementary number theory, Publish 
or Perish Press, Boston, 1974. 

J. M. Kantor and A. G. Khovanskii, Une application du Théoréme de Riemann-Roch combina- 
toire au polyndme d’Ehrhart des polytypes entier de R”, C. R. Acad. Sci. Paris Series I 317 (1993) 
501-507. 

A. G. Khovanskii and A. V. Pukhlikov, The Riemann-Roch theorem for integrals and sums of 
quasipolynomials on virtual polytopes, St. Petersburg Math. J. 4 (1993) 789-812. 

D. Knuth, The art of computer programming, Vol. 2. Addison-Wesley, Reading, Mass., 1981. 

C. Meyer, Uber einge Anwendungen Dedekindscher Summen, J. reine angewandte Math. 198 
(1957) 143-203. 

J. Pommersheim, Toric varieties, lattice points, and Dedekind sums, Math. Ann. 295 (1993) 1-24. 
H. Rademacher, On Dedekind sums and lattice points in a tetrahedron, Collected papers of Hans 
Rademacher, MIT Press, 1974, pp. 391-398. 

, and E. Grosswald, Dedekind sums, Carus Mathematical Monographs, The Mathematical 
Association of America, 1972. 

_____, and A. Whiteman, Theorems on Dedekind sums, Amer. J. Math. 63 (1941) 377-407. 


. K. Rosen, Lattice points in four-dimensional tetrahedra and a conjecture of Rademacher, J. reine 


angewandte Math. 307/308 (1979) 264-275. 
I. Vardi, The distribution of Dedekind sums, preprint, 1992. 
D. Zagier, Higher dimensional Dedekind sums, Math. Ann. 202 (1973) 149-172. 


Author's comment: In the course of proofreading, it was discovered that Pommersheim made an 
observation in his paper [14] similar to our idea of equating the Euler characteristic with the given 
cotangent Laurent expansion. His approach used toric varieties but translates into an equivalent 
statement. 


Temple University, Philadelphia, PA 19122 
matthias @euclid.math.temple.edu 


462 


NOTES [May 


THE EVOLUTION OF... 


Edited by Abe Shenitzer 
Mathematics, York University, North York, Ontario M3J 1P3, Canada 


Riemann’s Dissertation and Its Effect 
on the Evolution of Mathematics 


Detlef Laugwitz 


Translated from the German by Abe Shenitzer’ © 


A short account of the contents of the dissertation. Riemann’s doctoral disserta- 
tion of 1851 is titled Grundlagen fur eine allgemeine Theorie der Functionén einer 
veranderlichen complexen Grosse (Foundations for a general theory of functions of a 
variable complex quantity) [1, 3-43]. It is of modest size. In discussing it we use 
modern terms. 

Riemann defines holomorphic functions as complex single-valued functions on 
Riemann surfaces satisfying the Cauchy-Riemann differential equations. Riemann 
also worked with functions that were holomorphic except for finite poles in C. 
Such meromorphic functions are viewed as conformal mappings between two 
Riemann surfaces. We must always think of the complex plane as extended by the 
addition of the point : (as the Riemann complex number sphere or as a complex 
projective straight line). 

Functions must be thought of not as given by expressions but as determined (to 
within arbitrary constants) by the positions and nature of their singularities. This leads 
to the question of the construction of functions with prescribed properties on a 
given Riemann surface. Here the topology of the surface is of decisive importance. 
The surface T is decomposed by means of 1 crosscuts into a system of m simply 
connected surface pieces. The number n — m, which is independent of the manner 
of decomposition, is called the order of connectivity of T [1, 10-11]; incidentally, 
in modern terms, this number is equal to the negative of the Euler characteristic 
of T. 

In order to construct appropriate functions on 7, Riemann uses a variational 
principle. (He called it later the Dirichlet principle because he came to know 
similar procedures in Dirichlet’s lectures, and the historically unjustified name 
stuck.) First T is made into a simply connected surface T* by means of crosscuts. 
Then, subject to suitable boundary conditions, the integral 


[|@ = vy)” + (uy + v,)| dx dy 


" Translator’s note. Reprinted from “Bernhard Riemann 1826-1866: Turning Points in the Conception 
of Mathematics,” by Detlef Laugwitz, Translated by Abe Shenitzer. Copyright 1999 Birkhauser. This 
article is an excerpt (Section 1.2.2, pp. 108-110 and Section 1.2.5, pp. 124-130) from the author’s book 
Bernhard Riemann, published by Birkhauser Verlag in 1996. References such as Article 20 or §20 are to 
sections of Riemann’s dissertation. 


1999] {JE EVOLUTION OF... 463 


is minimized on this surface. If there are singularities to be taken into considera- 
tion, then the integral is somewhat modified. With the possible exception of the 
boundary of T*, the pair of functions u,v associated with the minimum is a 
holomorphic function f = u + iv. It should be noted that the functional values on 
the two edges of a crosscut need not coincide; jumps (“periods”) may occur. 

The paper ends with an application of these methods to the Riemann Mapping 
Theorem. This theorem asserts that in certain cases the topological equivalence of 
two surfaces or regions implies their conformal equivalence, i.e., the existence of a 
conformal mapping between them. Here the theorem is first stated for regions in 
the complex plane that are homeomorphic to a circular disk. 

We will examine’ the individual key words while considering further develop- 
ments in the work of Riemann and others. 

We explain briefly, in modern terms, the form of inference Riemann learned 
from Dirichlet. Let Ig, w) be the integral of 9, %, + gy, over a region G and let 
J(e) = I(g, ¢). Let n be a function that vanishes on the boundary dG of G. 


J(g + tn) =J() + 2u(¢, n) + t°I(n) 


implies that if J(g¢) < J(¢g + tn) is to hold for all ¢, then we must have I(9, 7) = 0. 
Put Ag = ¢,, + 9,,. Our last result, the vanishing of 7 on 0G, and the Gauss 
integral formula (Gauss’ theorem) imply that 


0= J (eng — gyn dx) = | (Ae)naF + I(,n) = | (Ae)naF, 


Since this holds for every n, it follows that Ag = 0. In other words, a function that 
minimizes J(¢) is a solution of Ag = 0. To be sure, the argument does not prove 
the existence of such a function, and this elicited justified criticism. 

It is relatively easy to prove the uniqueness of the solution of the boundary-value 
problem. If & were another solution, then ny = @— W would vanish on 0G. 
Moreover, 


J(g) =J(w) + 21(%, n) + J(n) 
and 


Hd,n) = J aly dy — a, de) — | (Av) nF = 0. 


But then 
J(g) =J(v) + J(n) = JC). 


In view of the minimality of J(¢), the inequality sign in the last expression must be 
replaced by an equality sign. But then J(y) = 0, ie., n, = n, = 0. Since n = 0 on 
dG, it follows that 7 = 0, and therefore Ww = g throughout G. 


The effect of the dissertation. Today we are inclined to regard Riemann’s disserta- 
tion as one of the most important achievements of 19th-century mathematics, but 
its immediate effect was rather slight. We saw that in the second part of Article 20 
Riemann himself emphasized just one principle, namely the determination of a 
function by as few data as possible and the elimination of expressions as defini- 
tions of functions. Given its vague formulation, this principle must have struck his 
contemporaries as neither new nor interesting. Riemann was as restrained in his 
statement as he was in the specification of his sources. 

The first person who had to read the paper carefully was the referee for the 
Gottingen faculty, that is, Gauss. His report read as follows: “The paper submitted 
by Herr Riemann is a concise testimony to its author’s thorough and penetrating 
studies of the area to which the subject treated therein belongs; of a diligent and 


464 THE EVOLUTION OF... [May 


ambitious, truly mathematical spirit of investigation, and of praiseworthy and 
fertile independence. The report is prudent and concise, and in places even 
elegant; nevertheless, most readers might well wish for even greater transparency 
of arrangement in some of the parts. Taken in its entirety, it is a solid and valuable 
work which not only meets the requirements usually set for test papers for the 
attainment of the doctorate but exceeds them by far.” 


Jie arm Poor & wegen fle Yop boy nv 
uae Jal ao Ry Sperer es poe poitifore tind pete ; 
wan just pares Weak Youd. i ae a abe, annd, a oP Iman 
in bem nts Jag enfh nod ang afted j wom nina float lon fl 
vietfonnbif fis Sou facnrg afte Mia sige rewrite wofomlefon’ ; Or Ie wg 

7 Ee Oe a ae obra off innfpiflls Peo aes, 
Yurloaxi fe fobs afag nod : Son avi file Plc Locfow mmf nck y 
gales er, wn Yuiten of vinw gator (Sif fiffbighrcd acl s 
tony inne fan. Sel Joye sf sree eden eon woo elh Ubud, 

‘i heal Guy 
rong ang Man Dothowars ms mf bliyS ¢ 


4 a. apr vas : Poninbe-.5 wee aia nes Bares oe 
[Mire won ey ser Sahai fel, 
aunree a sao ne ae Var £2 ven Voowit 


Weber. Coo 


Figure 1. Gauss’ testimonial on Riemann’s dissertation 


If one has a certain amount of experience with evaluations and forgets for a 
moment that here the princeps mathematicorum is writing about a person destined 
to become probably the most distinguished of his students, then one gets the 
following impression. The referee recognizes that the author has penetrated deep 
into a highly specialized field and has done this with great diligence, indepen- 
dently, and without the referee having to suggest the topic to him. There is no 
mention of the author’s new ideas, of the solution of problems, or of new methods, 
but it is recognized that he may well be showing signs of independent research 
activity. The presentation is terse, elegant only in spots, and on the whole not clear 
enough. An objective reader must wonder what was the basis for the “ Doktorvater’s”’ 
(doctoral adviser’s) very positive overall evaluation stated in the last sentence. 
Riemann wrote to his brother: 


When I visited Gauss he had not yet read my paper, but he told me that for years he had been 


preparing a paper (and is occupied with this right now) whose subject is the same, or partly the 
same, as the one I am treating 


1999] THE EVOLUTION OF... 465 


(Incidentally, this passage was quoted by Schering in his memorial address in 1866 
[2, 835].) So far, no one has been able to find any indication that Gauss had 
discussed with Riemann the contents of his paper or had given him any hints or 
suggestions. Riemann would have reported such things. After all, he mentioned 
the rather disappointing conversation with Gauss which comes down, more or less, 
to this: right now I happen to be writing on a related topic, but your paper has not 
interested me enough that I should immediately and eagerly plunge into it. 

Some (e.g., Remmert [6, Band 2, 158]) think that the old Gauss was “chary of 
praise” (“lobkarg’’). But what argues against this is the fact that a few years earlier 
he had praised young Eisenstein to the skies. We will make no guesses about the 
great Gauss’ admittedly baffling behavior toward Riemann. 

We summarize the essential mathematical concerns that originated in Riemann’s 
dissertation. 


(1) The idea of a Riemann surface. Here, for the first time, the domain of 
definition of a function becomes one of the data that determine it. The complex 
plane is compactified by the addition of a single point ©, the. Riemann surfaces 
over it are precisely defined, the connectivity number is introduced and recognized 
as a topological invariant. (Complex) analysis is carried out not locally but on 
manifolds, which are compact in the case of algebraic functions. Local repre- 
sentability (by power series) is proved but is of secondary importance. 

(2) In addition to poles, branch points are recognized as characteristic types of 
singularities, and the local series expansions in terms of (negative or fractional) 
powers are rigorously justified (Article 13/14, [1, 24-27]). 

(3) The existence (together with the continuity) of f’(z) is equivalent to the 
Cauchy—Riemann differential equations (together with the continuity of the occur- 
ring partial derivatives) and to the conformal character of f. It is also equivalent to 
the local expandibility, which implies the existence of all derivatives. (Holomorphic 
or analytic functions. ) 

(4) The transformation of surface integrals into line integrals is a tool for 
proving theorems (Articles 7-12, [1, 12—24]) of the “Cauchy type.” 

(5) The (“Dirichlet”) principle of the existence of a function that minimizes a 
surface integral is used to solve boundary-value problems by means of holomorphic 
functions. 

(6) The Riemann Mapping Theorem is a consequence of (5). 


The response of contemporaries was amazingly slight; hardly any of the more 
than 500 titles in Purkert’s list covering the period from 1851 to 1891 ([2, 869-895]) 
and relevant to Riemann’s dissertation appeared before his death. This is all the 
more surprising if we keep in mind that two of Riemann’s papers that presented 
the ideas of his dissertation in greater detail and applied them to the solution of 
problems appeared in 1857. Things were no different when it comes to textbooks. 
For example, Heinrich Weber’s Elliptische Functionen of 1891 contains. nothing 
relating to Riemann. Thus one can hardly speak of a significant impact of 
Riemann’s ideas during his lifetime and in the first 25 years after his death. In the 
subsequent sections we will examine the question of the very special directions in 
which Riemann influenced research and the question of which elements of his 
essential ideas failed initially to attract attention. 

Let us return to the year of the composition of the dissertation. Jacobi died on 
18 February 1851. Dirichlet pushed Riemann in another direction, which led to his 
habilitation paper on trigonometric series. Representatives of the algorithmic 


466 THE EVOLUTION OF... [May 


direction could hardly be expected to approve of Riemann’s dissertation. Eisen- 
stein died on 11 October 1852 and Weierstrass had not yet appeared on the scene. 
The French mathematicians, whose contributions were not explicitly acknowledged 
in the dissertation, could at best be expected to recognize the concept of a 
Riemann surface as new. At the same time, they viewed it as too complicated and 
superfluous. Moreover, Cauchy’s students soon got used to working with complex 
functions in the complex plane in much the same way as Cauchy, who had used 
complex formulations for his integral theorems and for his method of residues as 
early as 1831. They must have regarded the method of real partial differential 
equations as a backward step. At the time doubly periodic functions were in 
fashion, and they could be dealt with without the use of Riemann surfaces. 

Of course, in time the six previously listed key issues associated with the 
dissertation exerted a powerful effect. What follows is a survey describing 
this effect. 

The effect of (6) was later especially notable in applied mathematics. For a disk, 
the first boundary-value problem for the potential equation u,,+u,, =0 Is 
solved by the Poisson integral, which expresses the function u in terms of its 
boundary values. Since the differential equation is invariant under conformal 
mappings, we obtain a solution of this problem for any simply connected region 
bounded by a curve by mapping the disk conformally onto this region. But this is 
just an existence statement, and Riemann’s theorem does not directly yield a 
formula representing the solution. Such representations were eventually obtained 
for regions of practical importance by H. A. Schwarz, E. B. Christoffel, and others. 

The mapping theorem became effective in many respects independently of 
applications and of the other objectives and contents of the dissertation. It is an 
instance of Riemann’s novel view of mathematics. For one thing, it illustrates the 
fruitfulness of the notion that functions are simply mappings. For another, it is a 
global proposition; all Gauss could prove was the conformal equivalence of small 
pieces of surfaces. Finally it was one of the deeper existence theorems to emerge 
after Cauchy’s existence theorems about solutions of differential equations. For 
adherents of algorithms this was an unusual type of proposition; indeed, they took 
note of transformations only if they were associated with effective formulas. It is 
also noteworthy that the theorem shows that the theory of functions on a simply 
connected region with boundary is completely independent of the special choice of 
region. When investigating a special class of functions we can choose a convenient 
special region, say the upper halfplane. 

Riemann’s sketch of a proof in §21 is cryptic, and not just because of his use of 
the Dirichlet principle. Efforts to fully justify the idea of his proof failed. Given the 
importance of the theorem for applications, this failure stimulated attempts to 
develop new methods of proof. These remarks also apply to the uniformization 
theorem, which generalizes Riemann’s mapping theorem. The geometric formula- 
tion promoted the acceptance of the notion of a Riemann surface. Riemann 
himself spoke [1, 40] of “geometric clothing” (“geometrische Einkleidung’’) used 
for “illustration and more convenient wording” (zur “Veranschaulichung und 
bequemeren Fassung”), formulations hardly ever encountered elsewhere in his 
writings. The use of complex methods for the computation of definite integrals 
opened up a new field for the applicability of complex function theory, and that is 
why complex analysis became a fixed component of the mathematical education of 
physicists and engineers. As for mathematics itself, the question of admissible 
boundaries of simply connected regions provided essential impulses for the evolu- 
tion of point set theory. 


1999] THE EVOLUTION OF... 467 


For the effects of the dissertation in the first fifty years after Riemann, see [5]. 
For later developments see [6, Band 2, 157-163]. We recommend [3] and especially 
[4], a book saturated with Riemann’s style of thinking. It is safe to say that, even 
had Riemann’s dissertation consisted of just the mapping theorem, its influence 
would ultimately have been considerable. 

The effect of (5) was unexpected. Riemann’s justification of the existence of a 
minimal solution is inadequate. This was noted by Weierstrass, whose 1870 
criticism was devastating and seemed to destroy the very basis of Riemann’s 
justification of complex analysis. But this had also very positive consequences. 

One consequence was that people tried, successfully, to prove the relevant 
results without using the Dirichlet principle. Actually they would have tried to find 
such proofs regardless of doubts about this principle. Such attempts reflect the 
wish to construct complex function theory in a “purely complex” way and to avoid 
the use of tools from real analysis, functions u and v of two real variables x and y. 
This too was achieved. Incidentally, this does not signify the rejection of Riemann’s 
development of function theory. In view of its conceptual basis, it is closer to our 
way of thinking than is, say, the Weierstrass approach. 

Another consequence of the criticism directed at Riemann’s justification of the 
Dirichlet principle was even more important than the first one. Since there were 
no counterexamples and the principle itself was believable, people felt that it must 
be provable. Hilbert obtained a proof after 1900, and in doing so developed the 
so-called direct methods of the calculus of variations, which avoid the detour 
through the partial differential equations associated with the variational problem. 
One begins instead with a sequence of functions for which the values of the 
integral, or more generally of the functional, to be minimized approximate the 
infimum. One must show that the space of admissible functions has a compactness 
property which justifies the conclus:on that a subsequence converges to a function 
for which the functional takes on its minimum. In this way a method was 
developed that not only saved the Dirichlet principle but has progressively become 
more important in the 20th century. 

But let us go back briefly to the attempts to avoid the Dirichlet principle. Much 
was achieved by H. A. Schwarz and C. Neumann. As for the mapping theorem, the 
conclusive result was obtained independently by Poincaré and by Koebe in 1907. It 
asserts that every simply connected Riemann surface is holomorphically equivalent 
to one of following three surfaces: C U {} (the number sphere or complex 
projective straight line), C (the number plane or complex straight line), or the open 
disk |z| < 1. The key that leads one to this group of problems in the literature is 
the uniformization theorem. This problem and its easy-to-formulate answer were 
almost obvious to Riemann, but half a century was needed to obtain it. 

We do not know whether Riemann expected a stronger response. After all, he 
did say | 


However, we now refrain from the realization of this theory...for we rule out, at present, 
consideration of an expression of a function 


He set aside for a few years the task of investigating concrete functions and classes 
of functions, and tackled it in connection with lectures devoted to these matters. 
Of course, this did not happen during his first year as university instructor. 


468 THE EVOLUTION OF... [May 


REFERENCES 


i) 


6. 


(W.)) Bernhard Riemann’s gesammelte mathematische Werke and wissenschaftlicher Nachlass. 
Herausgegeben unter Mitwirkung von R. Dedekind von H. Weber. 2. Auflage: Teubner, Leipzig, 
1892. Reprint: Dover, New York, 1953. 

(N.) Bernhard Riemann, Gesanunelte mathematische Werke: wissenschaftlicher Nachlass und Nachtrage. 
Coll, Papers. Nach der Ausgabe von H. Weber und R. Dedekind neu herausgegeben von R. 
Narasimhan. Springer /Teubner, Berlin/Leipzig, 1990. 

Ahlfors, L. V. "Development of the theory of conformal mapping and Riemann surfaces through a 
century.” In Contrbutions to the theory of Riemann surfaces. Centennial celebration of Riemann’s 
dissertation. Annals of Mathematics Studies, No. 30, 3-13, Princeton, 1953. 

R. Courant, Dirichlet’s principle. conformal mapping and minimal surfaces, with an appendix by 
M. Schiffer. Interscience, New York /London, 1950. 

J. Gray, “On the history of the Riemann mapping theorem.” Studies in the history of mathematics, 
I. Supplemento ai Rendiconti del Circolo Matematico di Palermo, scr. I, no. 34. 47-94, 1994, 

R. Remmert. Funktionentheorie {. W. Springer, Berlin, 1991. 


Snow and Ice and Numbers 


It seems necessary to explain my claustrophobia to him. 

“Do you know what the foundation of mathematics is?” I ask. “The foundation of 
mathematics is numbers. If anyone asked me what makes me truly happy, I would say: 
numbers. Snow and ice and numbers. And do you know why?” 

He splits the claws with a nutcracker and pulls out the meat with curved tweezers. 

“Because the number system is like human life. First you have the natural numbers. 
The ones that are whole and positive. The numbers of a small child. But human 
consciousness expands. The child discovers a sense of longing, and do you know what 
the mathematical expression is for longing?” 

He adds cream and several drops of orange juice to the soup. 

“The negative numbers. The formalization of the feeling that you are missing 
something. And human consciousness expands and grows even more, and the child 
discovers the in between spaces. Between stones, between pieces of moss on the stones, 
between people. And between numbers. And do you know what that leads to? It leads 
to fractions. Whole numbers plus fractions produce rational numbers. And human 
consciousness doesn’t stop there. It wants to go beyond reason. It adds an operation as 
absurd as the extraction of roots. And produces irrational numbers.” 

He warms French bread in the oven and fills the pepper mill. 

“It’s a form of madness. Because the irrational numbers are infinite. They can’t be 
written down. They force human consciousness out beyond the limits. And by adding 
irrational numbers to rational numbers, you get real numbers.” 

I’ve stepped into the middle of the room to have more space. It’s rare that you have a 
chance to explain yourself to a fellow human being. Usually you have to fight for the 
floor. And this is important to me. 

“It doesn’t stop. It never stops. Because now, on the spot, we expand the real 
numbers with imaginary square roots of negative numbers. There are numbers we can’t 
picture, numbers that normal human consciousness cannot comprehend. And when we 
add the imaginary numbers to the real numbers, we have the complex number system. 
The .first number system in which it’s possible to explain satisfactorily the crystal 
formation of ice. It’s like a vast, open landscape. The horizons. You head toward them 
and they keep receding. That is Greenland, and that’s what I can’t be without! That’s 
why I don’t want to be locked up.” 


Smilla’s Sense of Snow, by Peter Hgeg, translated by Tiina Nunnally 
Dell Publishing, New York, 1994, pp. 121-122 


Contributed by Evan J. Romer, Windsor, NY 


1999] THE EVOLUTION OF... 469 


PROBLEMS AND SOLUTIONS 


Edited by Gerald A. Edgar, Daniel H. Ullman, and Douglas B. West 


with the collaboration of Paul T. Bateman, Mario Benedicty, Paul Bracken, Duane M. Broline, Ezra 
A. Brown, Richard T. Bumby, Glenn G. Chappell, Randall Dougherty, Roger B. Eggleton, Ira M. 
Gessel, Bart Goddard, Jerrold R. Griggs, Douglas A. Hensley, Richard Holzsager, John R. Isbell, 
Robert Israel, Kiran S. Kedlaya, Murray S. Klamkin, Fred Kochman, Frederick W. Luttmann, Vania 
Mascioni, Frank B. Miles, Richard Pfiefer, Cecil C. Rousseau, Leonard Smiley, John Henry Steelman, 
Kenneth Stolarsky, Richard Stong, Charles Vanden Eynden, and William E. Watkins. 


Proposed problems and solutions should be sent in duplicate to the MONTHLY 
problems address on the inside front cover. Submitted problems should include 
solutions and relevant references. Submitted solutions should arrive at that ad- 
dress before October 31, 1999; Additional information, such as generalizations 


and references, 1s welcome. The problem number and the solver’s name and ad- 
dress should appear on each solution. An acknowledgement will be sent only if a 
mailing label is provided. An asterisk (*) after the number of a problem or a part 
of a problem indicates that no solution 1s currently available. 


PROBLEMS 


10732. Proposed by M. N. Deshpande, Nagpur, India. Let n and k be positive integers 
with k < n. Select a permutation z of n objects at random, and let the random variable X, 
denote the number of objects that lie in cycles of 2 of length less than or equal to k. Find 
the expected value and the variance of Xx. 


10733. Proposed by Sung Soo Kim, Hanyang University, Ansan, Korea. Let {Eg}veg be 
a partition of the unit interval J = [0, 1] into nonempty sets that are closed in the usual 
topology. Is it possible that 

(a) &2 is uncountable and Eg, is uncountable for each a € &2? 

(b) Q2 is uncountable but Ey 1s countably infinite for each a € Q2? 

(c) &2 is countably infinite? 


10734. Proposed by Floor van Lamoen, Goes, The Netherlands. Let ABC be a triangle 
with orthocenter H, incenter 7, and circumcenter O. Let [P,r] denote the circle with 
center P and radius r. Show that the radical center of [A, CA + AB], [B, AB + BC], and 
[C, BC +CA] is the point obtained by reflecting H through O and then reflecting the result 
through /. 


10735. Proposed by Gustavus J. Simmons, Sandia Park, NM. If Ly is the n-by-n matrix 
with i, j-entry equal to a then L* = J, mod 2, where I, is the n-by-n identity matrix. 


Show that if R, 1s the n-by-n matrix with 1, j-entry equal to C2) then R? = [, mod 2. 


10736. Proposed by Mizan R. Khan, Eastern Connecticut State University, Willimantic, CT. 
For a givenn > 2, let M(n) = max{ la — b|: a,b € {1,2,...,n}andab = 1 mod n}. 
(a) Find a closed-form expression U(n) such that M(n) < U(n) for all n, with equality in 
infinitely many cases. 

(b) Show that lim,+.9 M(n)/n = 1. 

(c)* Prove or disprove that limyj—oo log(n — M(n))/logn = 1/2. 


470 PROBLEMS AND SOLUTIONS [May 


10737. Proposed by Hassan Ali Shah Ali, Tehran, Iran. Let m and n be positive integers 
with n > 2m, and let aj < az < --+ < dy be positive integers such that 


re. (s (3:)()) | 


Show that there exist two different n-tuples (€1,..., €,) and (6;,..., 6,), with entries 0, 1, 
and 2, such that )°7_, €; = D071 5; < 2m and )77_) €jaj = Dj) 8). 

10738. Proposed by Radu Theodorescu, Université Laval, Sainte-Foy, PQ, Canada. For 
t > 0, let ma(t) = Dopey ke t*/k! be the nth moment of a Poisson distribution with 
parameter t. Let c,(t) = my(t)/n!. A sequence do, a1, ... is log-convex if a? 41 SM 4n4+2 
for all n > O and is log-concave if a* 41 2 InGn4+2 for all n > 0. | 
(a) Show that mo(t), m ,(t), ... 1s log-convex. 

(b) Show that co(t), c1(t), ... is not log-concave when ¢t < 1. 

(c) Show that co(t), c1(t), ... is log-concave when t is sufficiently large. 

(d)* Is co(t), cy (t), .. . log-concave when t > 1? 


SOLUTIONS 


Moments of Roots of Chebyshev Polynomials 


10448 [ 1995, 360]. Proposed by Fu-Chuen Chang, National Sun Yat-sen University, Kaoh- 
siung, Taiwan. Fix a positive integer n. Let x; = cos ( (2i — 1)x/ (2n) ) for 1 <i <n, and 
let cg = + )7_, xf for k € N. Show that 
i {(: 7 ifk = 1,3,...,2n—1; 
(.j)2*  ifk =0,2,...,2n — 2. 


Solution I by Paul Deiermann, Louisiana State University, Shreveport, LA. When k = 0 and 
n is odd, the term for j = (n + 1)/2 appears as 0°, which must be taken to be 1 to arrive at 
the stated formula and our generalization. We show, for arbitrary integers k > 0, that 
0 . : for k odd, 
Ck = | 2 pa Pt) for k even, 

where m = |k/(2n)|. The stated problem covers those k for which m = 0. 

First note that x,41—j; = —x;, So the terms of the sum cancel in pairs when k is odd. We 
may thus restrict to the case of k even. Since x; = (e'7@/-D/2m) + e~im2j-D/2n)) 19. the 


binomial theorem and a summation of a finite geometric progression imply 


— a-k (in Sig \ pet AS (k iE (k—2g) 1% (q—k/2)j 
Saf = oot(enth perintt)! mat yy (esas 
j=l j=! 


j=l q=0 


k | n k n—| 
k Pa 4 - 270 : k . 7 » 20 
29K y ( eam > ein G-K/2)7 — 2k ) ( eee > ebin G-K/2u 


k | ifg—k/2=pn,peZ 

kK\ jx 4 | pn, p 

— 9-k 3m (2q—k) _pin(2q- 
=2 > (4 {ines ifn tq —k/2. 
q= 


1 —ei FE (g-k/2) 


Since k is even, gq —k/2 = pnimpliesg = pn+k/2. Then,0 <q <k gives—m < p< m. 
Also, in this case, ef in 29-k) — ginp — (—1)?. Thus, we get 


n k m k 
x*§ =2-*n -1?( ) 
er aad Cae 


p=—m 


1999] PROBLEMS AND SOLUTIONS 471 


Solution IT by Walter Van Assche, Katholieke Universiteit Leuven, Heverlee, Belgium. The 
x; are the zeros of the Chebyshev polynomial of the first kind T,, of degree n. The Gauss- 
Chebyshev quadrature formula has the property that the quadrature weights are constant; 
thus Gaussian quadrature gives 


jee a ig dx 
TL Sw=5] soz 


for every polynomial f of degree at most 2n — 1 (T. J. Rivlin, Chebyshev Polynomials, 
Wiley, 1990, pp. 43-46). Taking f(x) = x* for 0 < k < 2n — 1 then gives 


1 | | k dx 
C= x —_—_——.. 
mJ. J1—x? 
By symmetry this integral vanishes when k is odd. When k is even, the symmetry and the 
substitution x* = t gives | 


[ k dx [ k-1 dt 

a (2 —_. 

-1 V1l—x? 0 <vl-t 

The latter is Euler’s Beta function B( (k+1)/2, 1/2) =T((K+1)/2)P(1/2)/T(k/2+1). 
Now use Legendre’s duplication formula '(2z) = (20)~1/2922-1/2P (zy T(z +1 /2) with 
2z =k+1andI(1/2) = /z to find the desired results. 


Solution III by Franz Peherstorfer, Johannes Kepler Universitat, Linz, Austria. For x € 
[—1, 1], let 7,(x) = cos(narccos x) and U;,(x) = sin((n + 1) arccos x ) /sin(arccos x) 
denote the degree n Chebyshev polynomials of the first and second kind, respectively. 
Since T,(x) = 2"7! Tj — xi) and T(x) = nU;»_1(x), we have 


Un—1(x) 1 I -_ SA fix n I 
go) ty td (id) a o 


k=0 


for |x| > 1, where the second equality follows from a series expansion of (1 — x;/x)7!. 
On the other hand, we have [- (x) — (x? — 1)U2_,(x) = | for all x € R. Dividing both 
sides of this equation by (x7 — 1)T,?(x) gives 


ft Un—1(x) ( 1 4 a) = it _—O ft 
x2—1 9 Th(x) m1 T(x) J (x? —1)T 2x) 2? 
as X — OO. Since limy—+oo XUyn-1(X)/ Tn (x) = 1, this implies 
Un—1(x) = it 1 
ee (a1) 2) 


as x — oo. Taking the series expansion of 1 — x—2 in (2) and comparing to the series in 
(1) gives the desired result. 


Editorial comment. Wolfdieter Lang noted that the generating function )— k>0 cezk has been 
computed explicitly as an elementary function. See W. Lang, On sums of powers of zeros 
of polynomials, J. Comp. Appl. Math. 88 (1998) 237-256 for details and further references. 


Solved also by U. Abel (Germany), J. Anglesio (France), G. Bach (Germany), K. L. Bernstein, N. Bhatnagar, J. C. Binz (Switzer- 
land), P. Bracken & S. Dorf (Canada), R. J. Chapman (U. K.), H. Chen, E. Cohen (France), D. A. Darling, K. Diethelm (Germany), 
C. J. Efthimiou, R. Ehrenborg (Canada), S. M. Gagola Jr., M. E. H. Ismail, N. Komanda, R. L. Lamphere, W. Lang (Germany), 
J. H. Lindsey II, O. P. Lossers (The Netherlands), A. Pedersen (Denmark), N. Rosenberg, K. Foltz, H.-J. Seiffert (Germany), S. J. 
Smith (Australia), A. Stenger, R. Stong, M. Vowe (Switzerland), H. Widmer (Switzerland), Anchorage Math Solutions Group, 
NSA Problems Group, and the proposer. 


472 PROBLEMS AND SOLUTIONS [May 


Indecomposable Numbers 


10589 [1997, 362]. Proposed by Tim Keller, Fair Oaks, CA. Fix n > 3, and let S be the set 
of positive integers congruent to 1 modulo n. A number m € S is called indecomposable 
if it is not the product of two smaller numbers in S. Problem 2 from the 1977 International 
Mathematical Olympiad asks for a number that can be expressed as the product of indecom- 
posable numbers in more than one way. Show that the least such number is the product of 
two numbers each of the form k(k + 7). 


Solution by the GCHQ Problems Group, Cheltenham, U. K. Define a clone to be a number 
expressible as a product of indecomposable factors in two different ways. Let m be the 
smallest clone. By the minimality of m, no indecomposable factor can appear in both 
expressions. Let an + 1 be the smallest indecomposable factor in either expression, and let 
bn+1=m/(an+1). Letcn+1 be an indecomposable factor in the other expression, and 
letdn +1 =m/(cn +1). Thus m = (an + 1)(bn + 1) = (cn + 1)(dn + 1). 

Since cn + 1 is indecomposable, an + 1 does not divide it. Also an + 1 does not divide 
dn + 1, since otherwise dn + 1 is a smaller clone than m. Therefore an + 1 is not prime 
and factors as pq, where p|(cn + 1) and q|(dn + 1). Both p and q are coprime to n. 

Now p|(an + 1) and p|(cn + 1), so p|(c — a)n. Since p is coprime to n, we have 
p\(c — a), soc =rp+a, wherer > 1 sincec > a. Hencecn+1=rpn+an+1= 
rpn+ pq = p(rn+q). Similarly, g|(d — ayn leads todn + 1 = q(sn+ p), wheres > 1. 
Thus m = p(rn+q)q(sn+ p). 

Finally, we show thatr = s = 1. Lett = p(n+q)q(n+ p). Ifr > lors > 1, then 
t <m,sot must not beaclone. Since t = pq x (n+ p)(n+q) and pq is indecomposable, 
pq must divide one of the two factors in the factorization t = p(n + q) x q(n+ p). But 
if pqg|p(n + q), then pq|pn, and q|n, acontradiction since g is coprime to n. An identical 
argument shows that pq cannot divide q(n + p). 

With r = s = 1, we have m = p(n+ p) X q(n+qQq), as desired. 


Editorial comment. The proposer and the NCCU Problems Group both noted that pq is not 
necessarily the smallest composite congruent to 1 modulo n, giving the example n = 336, 
where 336k + 1 is prime for 1 < k < 3, 336-4+1=5.- 269, and 336-5+1= 41-41, 
but 5 - 269(5 + 336)(269 + 336) > 41 -41(41 + 336)(41 + 336). 


Solved also by X. Wang, NCCU Problems Group, and the proposer. 
Negatively Correlated Vectors of Signs 


10593 [1997, 456]. Proposed by Donald E. Knuth, Stanford University, Stanford, CA. A 
certain matrix has m rows and n = 1+ k* columns. All entries of the matrix are +1, and 
the dot product of any two columns is less than or equal to 0. Prove that the total number 
of positive entries in the matrix is at most 5m(n + k), and construct a matrix that achieves 
this upper bound. 


Solution by GCH@Q Problem Solving Group, Cheltenham, U. K. Consider the sum S of the 
dot products of all pairs of columns. Since each dot product is nonpositive, so is S. If row 
i has r; positive entries, then its contribution to the sum is (7) + ("3") — ri(n — ri), which 
equals ((2r; —n)* — n) / 2. 

Substituting 7; = (n +k + b;)/2 leads to 


$= 590 (+b n= 59 (+07 = (1+%)) = 1S (2kb; +0? 1). 


| i=l i=] 
Since S < 0, we obtain 


1999] PROBLEMS AND SOLUTIONS 473 


Since r; = (1 +k +k* +b;) /2 and r; is an integer, b; must be odd, and so | — b? < 0 for all 
i. Therefore ea b; <0. The total number of positive entries in the matrix thus satisfies 


m al m i m 
Xu" ae Gerkiine GeTh ts 2 SZut+h). 


Achieving the bound requires }°y__, b; = 0, which occurs only when half the rows have 
b; = +1 and the other half have b; = —1. Thus it is necessary that m be even. One matrix 
that achieves the bound when m = 2(n!) is formed by taking all n! permutations of a row 
with 5(n +k +1) positive entries and all n! permutations of arow with 5(n +k — 1) positive 
entries. By symmetry, all of the dot products are equal, and their sum is zero; hence each 
dot product must be zero. | 


Editorial comment. John H. Lindsey observed that equality in the bound requires m to be 
divisible by 4. The proposer asked for the smallest number of rows allowing equality to be 
achieved for a given n. He and Richard Stong independently provided a construction with 


i= 2s hep): 


Solved also by R. J. Chapman (U. K.), J. H. Lindsey II, K. McInturff, R. Stong, and the proposer. 
n-Tuples Whose Elements Divide Their Sum 


10597 [1997, 457]. Proposed by David Cox, Amherst College, Amherst, MA. Fix an integer 
n > 2, and let d,, dz,..., d, be positive integers with no common divisor greater than 1. 
Suppose that d; divides dj +---+d, for] <i <n. 

(a) Prove that d,d--- dy divides (dj + -+-+dy)"~*. 

(b) For each n > 3, give an example to show that the exponent in part (a) cannot be made 
smaller. | 


Solution by GCHQ Problems Group, Cheltenham, U. K. 

(a) Let p be a prime factor of the product d,dz---d,, and let p* be the highest power of 
p dividing any one of the d;. We have p* | )~d;, and thus p*“"~?) | (3° aj)"~*. Since 
d,,...,d, have no common factor greater than 1, some element d; is not divisible by p. 
Furthermore, since p | ) d;, at least two summands are not divisible by p. Hence the 
highest power of p dividing [|] dj; does not exceed p*"-*) Repeating this for each prime 
factor shows that [] d; divides ()* d;)"~?. 

(b) Let dy = 1,d. = n-—1, andd; =n for3 <i <n. Here )-d; = n(n - 1), 
which is divisible by each d;. Since d; = 1, the greatest common divisor is 1. We have 
[l@ = n"-*(n—1). Since n andn— 1 are coprime, the smallest power of n(n — 1) divisible 
by n"—2(n — 1) is (n(n — 1))"~?, and thus the exponent cannot be reduced. 


Editorial comment. Other examples submitted for part (b) by various solvers included 
d, = 1,d) =2,andd; =3-2'~? for3 <i <n 


and 
dj =1,d; =2for2 <i <n—1,and d, = 2n — 3. 


Using Euclid’s sequence 2, 3, 7, 43, 1807, ..., the San Jose State Problem Solving Ring gave 
an example in which djd2---d, = (dj +---+d,)"~*. Another use of Euclid’s sequence 
appears in this MonrtuHLy in the solution of Problem 10532 [1996, 510; 1998, 775], where 
references are given. 

M. J. Knight and the San Jose State Problem Solving Ring each showed that for given 
n the set Dy, of n-tuples (d;, d2, ..., dy) satisfying the conditions of the problem is finite. 
For example, D2 contains only the pair (1, 1), and D3 contains only the triples (1, 1, 1), 
(1, 1, 2), (1, 2, 3), and their permutations. The finiteness of D, is equivalent to the finiteness 


474 PROBLEMS AND SOLUTIONS [May 


of the set X,, of solutions of 1/x; + 1/x2 +---+1/x, = 1 1n positive integers, which was 
apparently first established by D. R. Curtiss, this MonTHLy 29 (1922) 380-387. A direct 
bijection between D, and X,, is obtained by setting x; = ()~d;)/dj. 


Solved also by R. Barbara (Lebanon), D. Beckwith, M. Boase (U.K.), J. Brawner, D. Callan, R. J. Chapman (U. K.), T. Hermann, 
R. Holzsager, T. Jager, S. A. Jassim (U. K.), M. J. Knight, C. Lanski, J. H. Lindsey I, D. Lorenzini, K. McInturff, R. Padma (India), 
K. Schilling, R. Stong, A. Tissier (France), SJSU Problem Solving Ring, and the proposer. 


Binomial Ratios 


10625 [1997, 871]. Proposed by Olaf Krafft and Martin Schaefer, Technical University 
Aachen, Aachen, Germany. For x > 0 and n € N, define 


me Te li) 


Solution I by Nora Thornber, Raritan Valley Community College, Somerville, NJ. Applying 
the binomial theorem four times, we have 


Evaluate limyp— soo An. 


But (1 —J/x)/A+ /x)| < 1,s0 we conclude that limy_o9 dn = SX. 


Solution II by The National Security Agency Problems Group, Fort Meade, MD. Let p = 


JX /[(/x + 1) and g = 1/(.,/x + 1), so that 0 < p,q <1, p+q = 1, and /x = p/q. 
Now consider an experiment consisting of 2” independent tosses of a coin that is biased to 
come up heads with probability p. Let E, (respectively, O,) be the probability that an even 
(respectively, odd) number of heads comes up. Set uy, = uy(p) = E,/On. Then 
gn-l qn ; on_9; 
re (2,) pq 2i 
ye ey | n n_(ot 
Ge ‘)p 21+ 1 g2"—(2i+1) 


7 Pye (i) e/a” Dino (aie) 
LE GCA eo 


Hence ayn = /xun. 
The independence of the various tosses implies E,4; = E,En + OnOn and Oni 


2E,O,. Therefore 
E?7+02 1 acid 
u =e SY —}. 
Ps ORO. ON ae 


By the arithmetic-geometric mean inequality, u, > 1; hence u, > (1/2)(u, + 1/uy,) = 
Un+ 1. Therefore the sequence u, is decreasing and bounded below; it follows that L = 
limy—+oo Un exists, and satisfies L = (1/2)(L + 1/L). Therefore L = 1, so we conclude 
that limy—so9 Ayn = /X. 


Solution II by Ulrich Abel, Fachhochschule Giessen-Friedberg, Friedberg, Germany. We 
prove the following generalization: For integers k > 1, r,s > 0, and real x > 0, we have 


b, = i (s—r)/k 
‘ em Bi Sea ae 


i>0 i=0 


Up, = 


In the special case k = 2,r = 0, 5 = 1, we have byn-1 = ayn, and conclude that a, > /x. 


1999] PROBLEMS AND SOLUTIONS 475 


Let z be a primitive kth root of unity. Then the finite geometric sum ee, zd is k if i is 
a multiple of k and 0 otherwise. Choose y > 0 with y* = x. We obtain 


kn \, 1 en \c sy ae kn\ ; ij 
ut) Eg da) BY Bye iy" ) xia! 


i>0 =0 i>r 
k-1 kn 
(1+ y) 
~ kyr ; d= ee tr 1) = ee 


asn —> oo, and this identity also holds with s in place ofr. Therefore b, — y*~" = x —r)/k 
asn —> Oo. 


Editorial comment. Jean Anglesio noted that when x is acomplex number (but not a negative 
real) the limit is the principal value of the square root of x. When x < 0 the limit does not 
exist. 

Solved also by S. A. Ali, K. F Andersen (Canada), J. Anglesio (France), D. Beckwith, C. Berg (Sweden), J. C. Binz (Switzerland), 
P. Bracken (Canada), D. Callan, R. J. Chapman (U. K.), J. E. Dawson (Australia), M. N. Deshpande (India), Z. Franco, C. Georghiou 
(Greece), T. Hermann, V. Hernandez (Spain), J.-H. Kim, R. A. Kopas, O. Kuba (Syria), N. F. Lindquist, J. H. Lindsey II, N. Lord 
(U. K.), S. Mahajan, D. A. Morales (Venezuela), M. Omarjee (France), M. M. Patnaik, G. Peng, H. Qin, H. Salle (The Netherlands), 
V. Schindler (Germany), R. Shahidi (Canada), N. C. Singer, A. Sofo (Australia), A. Stenger, D. B. Tyler, M. Vowe (Switzerland), 
M. Woltermann, Anchorage Math Solutions Group, GCHQ Problems Group, WMC Problems Group, and the proposer. 


A Triangle Inequality 
10644 [1998, 175]. Proposed by Mihaly Bencze, Brazov, Romania. Given an acute triangle 


with sides of length a, b, and c, inradius r, and circumradius R, prove that 


r abc 


Se ee 
2R ~ J2(a? + b?)(b? + c*)(c? +a”) 
Solution by the GCHQ Problems Group, Cheltenham, England. We have 
— (b* + c?)(1 — cos A) = b* +. c* — 2be cos A — (b? +. c*) + (b? +7) cos A 

= (b — c)* cos A > 0, 
since A is acute. Hence a” > (b* +c”)(1 —cos A) = 2(b? +c?) sin?(A/2). It follows that 
abc? > 8(a? + b*)(b* + c?)(c?2 +a”) sin*(A/2) sin*(B/2) sin?(C /2), and so 

abc | ee SB 

———————————————. > 2 Sin — sin — Sin —. 
2(a2 + b2)(b* + c?)(c? + a2) ) i i: 

The standard fact r = 4R sin(A/2) sin(B /2) sin(C /2) now yields the required result. 
Editorial comment. Several solvers noted that equality holds when the triangle is equilateral 
and that the result is valid also when the triangle is not acute. 


Solved also by J. Anglesio (France), E. Braune (Austria), Z. Cerin (Croatia), J. Melville (Scotland), C. A. Minh, P. E. Niiesch 
(Switzerland), G. Peng, C. Popescu (Belgium), C. R. Pranesachar (India), S. M. Soltuz (Romania), M. Vowe (Switzerland), 
R. L. Young, SAS Maths Club (india), and the proposer. 


Limit of a Recurrence 


10648 [1998, 271]. Proposed by N. P. Bhatia, University of Maryland, Baltimore County, 
MD, and W. O. Egerland, Bel Air, MD. Let z1, Z2,..., Zm be m => 2 points in the complex 
plane, and let p1, p2,..., Pm be positive real numbers such that pj + p2+---+ Pm = 1. 
For w real andn > m, let z, = (piZn—1 + P2Zn—-2 +++: + PmZn—m)e'@. Show that the 
sequence Z|, Z2,... converges, and determine its limit. 


476 PROBLEMS AND SOLUTIONS [May 


Solution by the editors. Let fx(s) = ae" and f(s) = Orme eae Since Zy+] 
is a convex combination of {z1,...,Zm}, the sequence is bounded and indeed |z,| < 
max {|zi|, |Z2|,---, [Zml}, so the radius of convergence for f(s) is positive. From the 
recurrence relation we have 


m 
f(s) = f(s) +e S > pes*(F (8) — fm—x(s)). 
k=1 
Thus f(s) is a rational function of s, say f(s) = A(s)/B(s). 

Assume first that e® = 1. Now B(s) = 1— oy, pes* has a zero at s = 1 since 
> 1 Pe = 1. It is a simple zero since B’(1) = — }°_ kpx is not zero. But B(s) has no 
other zeros on or inside the unit disk, since if |s| < 1, then | )°7_, pes“| < YL, peis*| < 
>=] Pk = 1, with equality only if |s| = 1 and all s* have the same argument. Thus 
we have a partial fraction expansion f(s) = A(1)/(B’(1)(s — 1)) + C(s) where C(s) is 
a rational function of s with all poles outside the unit disk. The Maclaurin series for C(s) 
has radius of convergence greater than 1, so its coefficients go to zero, while the Maclaurin 
series for A(1)/(B’(1)(s — 1)) has all coefficients equal to —A(1)/B’(1). It follows that 


ACL) 0%) Pe Son pe 2 
tim z, = — AQ) = 2etet Pk Lenem—teh En 


n->00 B’(1) diet kPk 
If e!® # 1, then B(s) has all its roots outside the closed unit disk, so we see that z, — 0 
without a partial fraction argument. | 


Solved also by D. M. Bradley, B. Burdick, D. Callan, R. J. Chapman (U. K.), G. Keselman, J. H. Lindsey II, V. Lucic (Canada), 
W. A. Newcomb, P. Szeptycki, E. I. Verriest, and the proposers. | | 


Random Polynomials with Real Roots 


10660 [1998, 366]. Proposed by Colin L. Mallows, AT&T Laboratories, Florham Park, NJ. 
Suppose the coefficients of a polynomial are independent Gaussian random variables, each 
with mean 0. For each € > 0, can the variances be chosen so that all of the zeroes of the 
polynomial are real with probability at least 1 — €? 


Solution by Kenneth Schilling, University of Michigan—Flint, Flint, MI. We prove the fol- 
lowing slightly stronger claim by induction on n. 


Proposition. Fix ¢ > 0 andn > 1. There exist 00, ..., 0, > 0 such that, if (a) ao, ..., An 
are independent Gaussian random variables with mean 0, (b) each a; has variance of, and 
(c) S is the event {ag ta,x +---+a,x" = 0 has n distinct nonzero real solutions x}, then 


S has probability at least 1 — «. 


Proof. This is obvious for n = 1. To complete the induction, let n and e¢ > 0 be given. 
Let oe. ..., 6,7 be variances as provided by the induction hypothesis applied to ¢/2 and n. 
Let f(x) denote the random polynomial ap + aj)x +--+ + ayx", and let g(x) = xf (x). 
Then on the event S, the function g has n + 1 distinct real zeros, the derivative g’ has n real 
zeros yj < -:: < y,, and the numbers g(y;) are nonzero and alternate in sign. Hence if 
|5| < min{|2g(y1)]|,..., lgQvn) |}, then h(x) = g(x) + 6 has n + 1 distinct real zeros. 

Define the random variable M = min{|g(x)|:x € R, 2’/(x) = O}. Since M > Oon S, 
there exists 6 > O such that the probability of the event SO {M > 6} 1s atleast 1—e/2. Now 
let b be a Gaussian random variable, independent of ao, ..., @,, with mean O and variance 
o*, where o” is chosen so that |b] < 6 with probability at least 1 — ¢/2. Then the event 
SN{M >6&}N{0 < |b] < 6} has probability at least (1 — ¢/2)* > 1 — ¢, and on this 
event the equation h(x) = b+ xf(x) = b+aox+ ax? +---+a,x"t! =Ohasn+1 
distinct nonzero real solutions. 


Solved also by J. H. Lindsey II, GCHQ Problems Group (U. K.), and the proposer. 


1999] PROBLEMS AND SOLUTIONS 477 


- REVIEWS 


Edited by Harold P. Boas 
Mathematics Department, Texas A & M University, College Station, TX 77843-3368 


Life’s Other Secret. By Ian Stewart. Wiley, New York, 1997, xiii + 285 pp., $24.95 
hardcover. 

The Magical Maze. By Ian Stewart. Wiley, New York, 1998, xii + 268 pp., $24.95 
hardcover. 


Reviewed by Dan Schnabel 


In the city where I reside, smaller bookstores are disappearing due to market 
pressure from book superstores. The selection of mathematics titles available in 
these new superstores is astonishing both in its magnitude and in its eccentricity. 
Books on tricks to improve basic arithmetic skills stand next to highly specialized 
research texts. Scattered among these are the mathematics popularization books. 
It is not obvious from the increased shelf space whether mathematics is actually 
becoming more popular, but these two recent books by Ian Stewart certainly 
further that goal. 

Any good high school mathematics teacher recognizes that to learn mathemat- 
ics, one must appreciate it; and real-world relevance helps students to appreciate 
mathematics. Beyond high school, the people who toil or play at mathematics are 
most likely to be those whose appreciation of the subject is well established. 
Relevance becomes a lesser concern if it remains a concern at all. Good efforts to 
popularize mathematics must return to the matter of relevance, and the best works 
do so in a manner that intrigues even those for whom relevance no longer matters. 
Jan Stewart’s books are among the best; they change the way we look at the world. 

Or even the way we look at ourselves. 

Standing in front of a mirror waving your left hand, you see an image of yourself 
waving its right hand. This leads to the question: “Why does a mirror reverse left 
and right, but not top and bottom?” The Magical Maze provides an explanation. 
For a bilaterally symmetrical object such as the human body, the image created by 
reflection in a mirror can also be achieved by a 180° rotation in space. Our visual 
processing system is conditioned to assume that the image results from a rotation, 
because in the real world it is possible for us to rotate objects manually, but it is 
not possible for us to reflect them. A left shoe will always be a left shoe, no matter 
how we manipulate it before our eyes. Seeing symmetry broken by the waving of 
one hand, we still assume that the image is the result of a rotation—the rotation of 
a person waving the other hand. 

Stewart points out that top and bottom are not switched if you turn the mirror 
on its side, but he does not consider the more intriguing case in which you lie on 
your side. Lie on your right side in front of a mirror so that your left leg is on top 
and your right leg is on the bottom. The image in the mirror has the left leg on the 
bottom and the right leg on top. Ignoring anything else that might appear in the 
mirror, we can no longer distinguish whether the mirror has switched left and right 
or switched top and bottom. 


478 REVIEWS [May 


Stewart’s ideas and presentation frequently inspired me, even required me, to 
think beyond his writing, as when I considered the following thought experiment. 
Imagine an intelligent creature having both left-right symmetry and top-bottom 
symmetry. (How many eyes would such a creature have?) Would this creature 
think that mirrors switch left and right, or top and bottom? I suspect that, even 
though its image can be achieved through two different rotations (as well as 
through reflection), the creature would still see the mirror as swapping left and 
right, because of the uniqueness of the vertical orientation it learns from the pull 
of gravity. What, then, is the role of gravitation in the way our own visual 
conditioning interprets mirror images? I found myself wondering what Stewart 
would have to say about this. - | 

Symmetry is a recurring motif in both these books, and the books themselves 
are somewhat symmetrical. The Magical Maze is a mathematics book in which one 
encounters some biology, while Life’s Other Secret, subtitled The New Mathematics 
of the Living World, is essentially a biology text in which one encounters some 
mathematics. 

How much mathematics Life’s Other Secret can be said to contain depends on 
what one regards as mathematics. Stewart would like readers to acknowledge a 
broad meaning for mathematics and to recognize a large role for mathematics in 
the biological sciences. 

The title Life’s Other Secret refers to the idea that genetics and DNA do not 
provide the complete picture for life on earth. Stewart suggests a different role 
for genes: 


The cell carries out its genetic instructions; the laws of physics and chemistry produce certain 
results, and when you put the two together, you get an organism. 


Consequently, an understanding of the laws of physics and chemistry and of the 
underlying mathematics is equal in importance to genetics in the understanding 
of life. 

Given the symmetrical relationship between the two books, it is no surprise that 
Stewart’s best examples of the intersection of biology and mathematics are ad- 
dressed in both books. 

Each of the books contains a discussion of why the number of petals on most 
flowers is a Fibonacci number. The question is quickly reduced to the arrangement 
of tissue called primordia at the tip of a plant shoot. As a plant grows, these 
primordia appear in places that are determined by the need to be closely packed 
around a circle. The best packings occur when the primordia are separated by an 
irrational multiple of 360°, because rational multiples of 360° would generate 
“spokes” of the primordia. The theory of continued fractions can be used to show 
that the golden ratio ¢ = (1 + ¥5)/2 is the “most irrational” number. Measure- 
ments confirm that primordia are usually positioned around a “generative spiral” 
separated by an angle that, measured externally, is approximately 360/¢ degrees; 
measured internally this is approximately 137.5°. 

The most intriguing aspect of this discussion is that nature is consistent with 
number theorists on the matter of determining “how irrational” an irrational 
number is, at least in the case of the golden ratio. It is also interesting that nature 
appears unwilling to settle for anything other than the most irrational number. 
Reading Stewart left me wondering what sort of packings would result from 
irrational numbers that are not the most irrational. 

Stewart focuses on the sunflower plant, the large head of which clearly demon- 
strates the packing problem. He suggests that the position of primordia provides a 


1999] REVIEWS 479 


nearly complete explanation of the pattern of spirals on the head of a sunflower 
plant. Missing are the mathematical details explaining why the number of clock- 
wise spirals and the number of counterclockwise spirals that we perceive are two 
consecutive Fibonacci numbers. We read only that it is because of the close 
relationship between Fibonacci numbers and the golden ratio, but I would like to 
have been shown more of the mathematics explaining how the number of spirals 
we see results from the separation on a “generative spiral”. Moreover, Stewart 
never clarifies the connection between the actual number of petals and the 
position of primordia—the connection between “how many to arrange” and “how 
to arrange them”. | 

Both of these books will frequently frustrate the more mathematically minded 
reader, as they often stop short of the interesting, nitty-gritty mathematical details. 
This is not necessarily a bad thing: the books are quite effective at whetting one’s 
appetite for more mathematics. But they are not always good at indicating where 
to turn for more details. The Magical Maze, which is the more mathematical of the 
two books, does provide more details in the endnotes, but they are referred to in 
the text in a manner that, while intended to be consistent with the maze metaphor 
of the book, is awkward. The endnotes are numbered, but the references to them 
are not, so unless you always read all the endnotes, it is unclear which one is 
being cited. | 

On the matter of mathematical details, Life’s Other Secret is the weaker, more 
frustrating, of the books, as it is essentially a book of biology. It talks about 
mathematics without actually including a great deal of traditional mathematics. 
When Stewart simplifies explanations with expressions such as “The mathematical 
machinery reveals...,” I cannot help wanting to see the machinery in action, not 
just the results. While there are complicated details of how a tobacco virus 
develops, nowhere is there mathematics of a comparable level of difficulty. 

How much mathematics should be included in a popular mathematics book? 
This is one of the fundamental questions facing writers in the genre. Certainly The 
Magical Maze is a book in which mathematicians will feel at home. It contains a 
great deal of mathematics, and it attempts explanations that are unusually complex 
for a book of its type. Its level of sophistication led me to speculate on the 
possibility that it succeeded in being published primarily because it was written to 
accompany the televised 1997 Christmas Lectures of the Royal Institution of Great 
Britain. Not that I mean by this to disparage the book itself; rather, I wonder 
about the system that makes books of this calibre a rare occurrence. 

No doubt readers with only basic high school mathematics training will find The 
Magical Maze difficult going, made more so by an uncharacteristically high number 
of mistakes in the explanations and diagrams. While some mistakes are typographi- 
cal and others appear to be printing errors, there are genuine calculation errors as 
well. Such challenges to comprehensibility may prevent this book from achieving 
its popularization aims, but do not significantly diminish its quality. 

Meritorious as these two books are, publishers and the general public only seem 
to readily embrace more superficial books, such as Stewart’s earlier work Nature’s 
Numbers. The dust jacket for The Magical Maze includes Nature magazine’s praise 
of Nature’s Numbers: 


Stewart achieves what other popular mathematics writers merely strive for: an accurate, 
informative portrayal of contemporary mathematics without a single equation in sight. 


As much as I liked Nature’s Numbers, | hope that other popular mathematics 
writers are not striving to eliminate equations from their books. The ability to 


480 REVIEWS [May 


express abstract ideas in the form of equations is a cornerstone of mathematics. 
The near-taboo status of equations in popular books is irksome. Although doing 
mathematics and writing about mathematics are two different things, books that 
avoid equations altogether are not popularizing mathematics as it truly is, but are 
merely making mathematics marketable. 


6000 Yonge Street #510, Toronto, Ontario, Canada M2M 3WI1 
schnabel@interlog.com 


An Introductory Course in Commutative Algebra. By A. W. Chatters and C. R. 
Hajarnavis. Oxford University Press, 1998, viii + 144 pp., $35 softcover, 


$75 hardcover. 
Introduction to Algebra. By Peter J. Cameron. Oxford University Press, 1998, x + 295 
pp., $32 softcover, $65 hardcover. 


Reviewed by Cynthia Woodburn 


Driving home last night, I was unhappy with the particular selection playing on my 
favorite classical radio station, so I switched to my second favorite classical radio 
station. Imagine my surprise to hear a voice discussing mathematics, and even 
more surprisingly, discussing the new pop fascination with mathematics. Evidently, 
there is a trend in pop culture towards the notion that “math is cool’. There is 
even a cologne for men available now named “Pi”. The two books under review 
may not make it onto any popular best-seller lists, be made into movies, or have 
colognes named after them, but both could be useful in helping students to 
appreciate that “abstract algebra is cool”. 

An Introductory Course in Commutative Algebra is a “lean and lively” introduc- 
tion to commutative algebra with a definite number-theory perspective. Many 
examples are number theoretic in nature, and number theory is used frequently to 
motivate new concepts. For example, the chapter on ruler and compass construc- 
tions includes a discussion of the connection between Fermat primes and the 
constructibility of regular n-gons. Written for use by undergraduates, the book is 
appropriate for a second-semester course in abstract algebra. Its prerequisites 
include knowledge of equivalence relations, some elementary group theory such as 
Lagrange’s Theorem, and some basic linear algebra. With caution being used at 
the places where some elementary group theory is assumed (for instance, Chapter 
10 on finite cyclic groups and finite fields), the book could be used as a text for a 
first-semester abstract algebra course; it would be especially good for a class 
composed of secondary mathematics education majors. 

The book begins with an introductory chapter on rings. Since the focus is on 
commutative algebra, a ring is defined as a commutative ring with identity. For 
those more familiar with a ring not necessarily being commutative or having a 
multiplicative identity, some minor adjustments in thinking must be made. For 
example, if one defines a ring in this fashion, then ideals are not typically subrings, 
and the even integers do not form a subring of the integers. Chapters 2 and 3 cover 
Euclidean rings and the highest common factor. I was disappointed to find that the 


1999] REVIEWS 481 


Euclidean algorithm is not included (although it is mentioned on p. 39 in Chapter 
6). The optional Chapter 4 uses “‘the ring of Gaussian integers to prove one of the 
most famous theorems in number theory: every positive integer is the sum of four 
squares.” Next are chapters on the traditional topics of fields and polynomials, 
unique factorization domains, the field of quotients of an integral domain, factor- 
ization of polynomials, fields and field extensions, finite cyclic groups and finite 
fields, and algebraic numbers. Chapter 12 on ruler and compass constructions 
contains instructions for classical constructions, such as bisecting angles and line 
segments and dropping perpendiculars, along with the algebra of constructible 
numbers. The three impossible constructions from antiquity are discussed, as well 
as the proof by Gauss that the regular 17-gon is constructible (the longest proof of 
the text). The final three chapters cover homomorphisms, ideals and quotient rings 
(some familiarity with quotient groups is assumed), principal ideal domains and a 
method for constructing fields, and finite fields. 

The text is extremely readable with a very concrete approach. It is evident that 
the authors strove to make the text understandable by undergraduate students. 
Comments are included to explain the significance of results. Definitions are 
often repeated when terminology is reused, and difficult concepts are explained 
in everyday language. The book abounds with examples, many of which include 
actual numbers, something students who are struggling with proofs and abstraction 
will appreciate. Most of the exercises are straightforward. Some are very easy, such 
as proving that every field is an integral domain (which appears as Exercise 5.1 
and Exercise 9.2). More challenging exercises have hints or contain sketches 
of a solution with the details to be filled in by students. Answers to selected 
exercises can be found in the back of the book, although there is 
no notation within the exercise sets to indicate which problems have solutions 
provided. 

The second text under review, Introduction to Algebra, is “lively” but not “lean’’. 
While the text by Chatters and Hajarnavis contains fifteen short chapters (the 
shortest—on constructing the field of quotients of an integral domain—is 2 
pages, and the longest—on ruler and compass constructions—is 15 pages), the text 
by Cameron, twice the length, is organized into eight large chapters with sections 
and subsections. This book contains more information than can be covered in a 
year-long algebra course, which allows for flexibility in its use. After an introduc- 
tory chapter containing preliminary concepts and motivating material about alge- 
bra, the book begins with a study of ring theory guided by the familiar example of 
the integers. Group theory follows. As pointed out by the author in the preface, 
these two chapters could form the basis for an introductory one-semester course in 
abstract algebra. Next are chapters on linear algebra and module theory. Chapter 6 
is a change of pace: it contains a formal construction of the natural numbers, 
integers, rational numbers, real numbers (via Cauchy sequences), and complex 
numbers; also included are algebraic and transcendental numbers. Chapter 7 
contains further topics from group theory, ring theory, and field theory, along with 
other advanced topics not typically found in a book at this level: namely, universal 
algebra, lattices, and category theory. The final chapter, entitled “Applications’’, 
discusses Galois theory (a classical application), and error-correcting codes (a 
modern application). 

Cameron’s writing style is very enjoyable and reader-friendly. He uses entertain- 
ing verbs such as “whittle” and “blur” and gives many examples throughout the 
book. Modern and up-to-date analogies help students relate to concepts: for 
example, rings with special properties are compared to personal computers with 


482 REVIEWS [May 


extra features. Connections between concepts are emphasized. The use of certain 
terminology and notation is explained, and differences in notation are pointed out. 
For instance, maps and functions are written on the right in the text, and the 
reader is urged to “remember that not everybody uses this convention!” Solutions 
to selected exercises are provided in the back of the book. There is no notation 
within the exercise sets to denote those problems whose solutions are given, but 
more difficult exercises are marked with an asterisk. Some of the exercises are 
even marked with two asterisks. 

Even though the text is reader-friendly, a high level of rigor is maintained. 
Kernels are first defined as equivalence relations, polynomials are defined . as 
infinite sequences, and three different proofs of the existence of transcendental 
numbers are given. 

Both texts include some historical background of terminology and results. 
Cameron’s book also includes some interesting mathematical folklore, which piques 
the interest of many students. For example, he relates that “legend has it that the 
[irrationality of the square root of 2] was discovered by Hippasos, a member of the 
Pythagorean Brotherhood; he was expelled from the Brotherhood (or, in some 
versions, drowned in a shipwreck) to prevent him from revealing the shameful 
truth that nature contains irrationality.” Each text also includes references or 
sources for further reading. About half of the references in An Introductory Course 
in Commutative Algebra are from the area of number theory. Introduction to Algebra 
has a more extensive and comprehensive list of sources for further reading, with 
several introductory paragraphs expounding upon the sources. 

Missing from both texts are ideas for cooperative learning activities or 
writing assignments. Cameron’s text does have a very nice web page at 
http: // www.maths.qmw.ac.uk/~pjc /algebra/. It contains solutions to 
all of the exercises from the first three chapters of the text in both LATRX and 
PostScript formats, further material and problems, links to other sites of interest to 
algebraists, and corrections. 

There are many abstract algebra books on the market. A subject search through 
a popular online book distributor yielded a list of 148 abstract algebra books and a 
list of 52 books classified under commutative algebra. The books by Chatters and 
Hajarnavis and by Cameron are fine additions to the collection of abstract algebra 
books available for use at the undergraduate level, and each in its own way does a 
great job of advancing the notion that “abstract algebra is cool’. 


Pittsburg State University, Pittsburg, KS 66762 
cwoodbur(@pittstate.edu 


1999] REVIEWS 483 


TELEGRAPHIC REVIEWS 


Edited by Arnold Ostebee 


with the assistance of the Mathematics Departments of 
Carleton, Macalester, and St. Olaf Colleges 


Telegraphic Reviews are designed to alert readers in a timely manner to new books 
appropriate to mathematics teaching and research. Special codes classify reviews by 


subject area and appropriate use: 


T : Textbook 
C : Computer Software 


S : Supplementary Reading 


Readers are advised that price information is subject to change. 
receive a second, more extensive review in the Monthly. 


P : Professional Reading 
L : Undergraduate Library 
13: Grade Level 


1—4: Semester 

** : Special Emphasis 

?? : Questionable 
Selected books 


Books submitted for review should be sent to Book Reviews Editor, American Mathe- 
matical Monthly, St. Olaf College, 1520 St. Olaf Avenue, Northfield, MN 55057-1098. 


Mathematics Appreciation, T(13: 1, 2). 
Mathematics: A Practical Odyssey, Third Edi- 
tion. David B. Johnson, Thomas A. Mowry. 
Brooks/Cole, 1998, xx + 835 pp, $66.95. [ISBN 
0-534-35075-5] Revisions include new sec- 
tion on right angle trigonometry, appendix on 
dimensional analysis, optional subsections uti- 
lizing graphing calculator capabilities. (Second 
Edition, TR, November 1995.) KES 


Education, P*, L. Confronting the Core Cur- 
riculum: Considering Change in the Under- 
graduate Mathematics Major. Ed: John A. 
Dossey. MAA Note Series No. 45. MAA, 
1998, xii+ 136 pp, $38.50 (P). [ISBN 0-88385- 
155-5] Proceedings of the 1994 “West Point 
Core Curriculum in Mathematics” conference 
and a 1995 follow-up workshop. Papers dis- 
cuss course content issues and student growth 
goals for the first two years of the undergraduate 
mathematics curriculum. AO 


Discrete Mathematics, T(13—14: 1, 2), L. Dis- 
crete Algorithmic Mathematics, Second Edi- 
tion. Stephen B. Maurer, Anthony Ralston. 
AK Peters, 1998, xix + 894 pp, $59. [ISBN 
1-56881-091-1] Republication of 1991 edi- 
tion with corrections and a few small changes. 
Greater emphasis on algorithmics, and more so- 
phisticated, than most discrete texts. (First Edi- 
tion, TR, June-July 1991.) KES 


Number Theory, P. Finite Fields: Theory, Ap- 
plications, and Algorithms. Eds: Ronald C. 
Mullin, Gary L. Mullen. Contemp. Math., 
V. 225. AMS, 1999, x + 243 pp, $49 (P). 
[ISBN 0-8218-0817-6] Proceedings of a 1997 
conference at the University of Waterloo. 


484 


TELEGRAPHIC REVIEWS 


Group Theory, P. The Structure of Compact 
Groups: A Primer for the Student—A Hana- 
book for the Expert. Karl H. Hofmann, Sidney 
A. Morris. Stud. in Math., V. 25. Walter de 
Gruyter, 1998, xvii + 835 pp, $148.95. [ISBN 
3-11-015268-1] A massive, self-contained re- 
source for experts that avoids representation 
theory and harmonic analysis. JD 


Group Theory, P. Algebraic Groups and their 
Representations. Eds: R.W. Carter, J. Saxl. 
NATO ASI Ser. C, V. 517. Kluwer Academic, 
1998, xvili + 374 pp, $173. [ISBN 0-7923- 
5251-3] 19 articles written by speakers at the 
19977 NATO Advanced Study Institute ‘““Modu- 
lar Representations and Subgroup Structure of 
Algebraic Groups and Related Finite Groups” 
held at the Isaac Newton Institute, Cambridge. 


Ring Theory, P. Semidistributive Modules and 
Rings. Askar A. Tuganbaev. Math. & Its Ap- 
plic., V. 449. Kluwer Academic, 1998, x + 
352 pp, $157. [ISBN 0-7923-5209-2] Ex- 
plores the relationship between semidistributive 
modules and flat, projective, injective, multipli- 
cation, and Bezout modules. JD 


Ring Theory, T(18), P, L. Lectures on Moa- 
ules and Rings. T.Y. Lam. Grad. Texts in 
Math., V. 189. Springer-Verlag, 1999, xxiii 
+ 557 pp, $59.95. [ISBN 0-387-98428-3] A 
follow-up to Lam’s A First Course in Noncom- 
mutative Rings (TR, February 1992); focuses 
on ring theory in which modules play a central 
role. Topics: free, projective, injective, and flat 
modules, rings of quotients, Frobenius rings, 
Morita theory. Includes 600 exercises. JD 


Algebra, T(16—18: 1, 2), L. Abstract Algebra, 


[May 


Second Edition. David S. Dummit, Richard M. 
Foote. Prentice Hall, 1999, xiv +898 pp. [ISBN 
0-13-569302-0] New material on quadratic 
integer rings, tensor products of modules and 
tensor algebras, homological algebra, group co- 
homology; new chapters on commutative rings 
and algebraic geometry. (First Edition, TR, 
January 1993.) KES 


Algebra, T(18), P, L. Representations and Co- 
homology, I: Basic Representation Theory of 
Finite Groups and Associative Algebras. D.J. 
Benson. Stud. in Adv. Math., V. 30. Cam- 
bridge Univ Pr, 1995, xi + 246 pp, $29.95 (P); 
$52.95. [ISBN 0-521-63653-1; 0-521-36134- 
6] Provides-modular representation theoretic 
background for the study of group cohomol- 
ogy in Volume II. Volume I concentrates on 
Auslander—Reiten type representation theory, 
Burnside rings, and block theory with a coho- 
mological flavor. JD 


Algebra, P. The Monster and Lie Algebras. 
Eds: J. Ferrar, K. Harada. Ohio St. Univ. Math. 
Res. Inst. Public., V. 7. Walter de Gruyter, 
1998, x + 252 pp, $248. [ISBN 3-11-016184-2] 
Proceedings of a 1996 special research quarter 
at the Ohio State University. In two parts: 9 pa- 
pers on the Monster; 7 papers on Lie Algebras. 


Calculus, 8(13), L. How to Ace Calculus: The 
Streetwise Guide. Colin Adams, Joel Hass, 
Abigail Thompson. WH Freeman, 1998, x + 
242 pp, $14.95 (P). [ISBN 0-7167-3160-6] 
Highly entertaining, useful. Gives advice on 
such matters as “How to deal with your instruc- 
tor” (e.g., ask: “Where did you get those ultra 
cool shoes?”) together with a serious, very tra- 
ditional treatment of calculus topics (e.g., gives 
much more space to techniques of integration 
than to numerical techniques; computers are 
barely mentioned). Certain “light” sections 
(e.g., “Choosing your instructor’) may put some 
people off. KS 


Calculus, T*(13: 3). Calculus: Single and 
Multivariable, Second Edition. Deborah 
Hughes-Hallett, et al. Wiley, 1998, xix + 
984 pp, $111.95, [ISBN 0-471-19490-5]; Cal- 
culus: Single Variable, Second Edition. Wi- 
ley, 1998, xvii + 647 pp, $79.95 (P). [ISBN 
0-471-16442-9] From the Preface: “We have 
streamlined some topics and added new sections 
on theory and on skill-building; we have moved 
some material into separate sections on model- 
ing.” (First Edition, TR, February 1994.) 

Real Analysis, T(17: 3, 4), P, L. Fundamentals 
of Real Analysis. Sterling K. Berberian. Uni- 
versitext. Springer-Verlag, 1999, xi + 479 pp, 
$44.95 (P). [ISBN 0-387-98480-1] Lecture 
notes from year-long course given at the Uni- 


1999] 


TELEGRAPHIC REVIEWS 


versity of Texas (1985-86). Includes introduc- 
tory chapters on foundations and basic topology, 
then detailed treatment of Lebesgue and abstract 
measure theory leading to function spaces. In- 
cludes other topics. Plenty of exercises. KS 


Complex Analysis, T(18: 1), P.. Computa- 
tional Conformal Mapping. Prem K. Kythe. 
Birkhauser Boston, 1998, xv + 462 pp, $69.95. 
[ISBN 0-8176-3996-9] The theory and com- 
putation of conformal mappings of simply or 
multiply connected regions onto the unit disk 
and other canonical regions. Applies theory to 
mathematics, physics, and engineering. PG 


Complex Analysis, P. Complex Geometric 
Analysis in Pohang. Eds: Kang-Tae Kim, 
Steven G. Krantz. Contemp. Math., V. 222. 
AMS, 1999, vii + 256 pp, $55 (P). [ISBN 
0-8218-0957-1] Proceedings of a 1997 con- 
ference on several complex variables at Pohang 
University (South Korea). 7 


Dynamical Systems, P. Nonlocal Bifurca- 
tions. Yu. Ilyashenko, Weigu Ki. Math. Surv. 
& Mono., V. 66. AMS, 1999, xiii + 286 pp, 
$69. [ISBN 0-8218-0497-9] Modern theory 
of normal forms for local families of vector 
fields and diffeomorphisms, hyperbolic theory, 
study of bifurcations on boundaries of Morse— 
Smale systems. RM 


Numerical Analysis, (17), P. Numerical Lin- 
ear Algebra for High-Performance Computers. 
Jack J. Dongarra, et al. SIAM, 1998,.xviii + 
342 pp, $37 (P). [ISBN 0-89871-428-1] Sur- 
veys the state of the art of solving systems of lin- 
ear equations and large-scale eigenvalue prob- 
lems on high-performance (i.e., vector and par- 
allel) computers. A major revision of Solving 
Linear Systems on Vector and Shared Memory 
Computers (TR, May 1991). AO 


Functional Analysis, P. Banach Algebras ’97. 
Eds: Ernst Albrecht, Martin Mathieu. Walter 
de Gruyter, 1998, x + 566 pp, $148.95. [ISBN 
3-11-015466-8] Proceedings of a conference 
held at the University of Tiibingen. Research ar- 
ticles, survey articles on problems in automatic 
continuity and problems related to notions of 
amenability, and a list of open questions. 


Analysis, P, L. A Primer of Infinitesimal Analy- 
sis. John L. Bell. Cambridge Univ Pr, 1998, xiii 
+ 122 pp, $29.95. [ISBN 0-521-62401-0] De- 
velops basic calculus (single and multivariable) 
and some physical applications in the context of 
smooth infinitesimal analysis; includes a chap- 
ter on synthetic differential geometry. Theory 
based on nilpotent infinitesimals (from category 
theory) rather than nonstandard analysis. AO 


Analysis, P. Wavelets and Their Applications: 


485 


Case Studies. Ed: Mei Kobayashi. SIAM, 
1998, xvi + 142 pp, $32 (P). [ISBN 0-89871- 
416-8] 5 independent essays describe the use 
of wavelet techniques in mechanical and nuclear 
engineering, seismology, signal processing, and 
partial differential equations. 


Algebraic Geometry, P. Higher Homotopy 
Structures in Topology and Mathematical 
Physics. Ed: John McCleary. Contemp. 
Math., V. 227. AMS, 1999, xii + 321 pp, 
$69 (P). [ISBN 0-8218-0913-X] Proceedings 
of a 1996 conference at Vassar College held to 
honor the 60th birthday of Jim Stasheff. 


Geometry, P. Advances in Discrete and Com- 
putational Geometry. Eds: Bernard Chazelle, 
Jacob E. Goodman, Richard Pollack. Con- 
temp. Math., V. 223. AMS, 1999, xi + 463 pp, 
$99 (P). [ISBN 0-8218-0674-2] Proceedings 
of the 1996 AMS-IMS-SIAM Joint Summer 
Research Conference “Discrete and Compu- 
tational Geometry: Ten Years Later” held at 
Mount Holyoke College. 


Algebraic Topology, T(18), P. Model Cate- 
gories. Mark Hovey. Math. Surv. & Mono., 
V. 63. AMS, 1999, xii + 209 pp, $54. [ISBN 0- 
8218-1359-5] Much needed comprehensive 
resource on the relationship between a model 
category and its homotopy category. Accessi- 
ble to graduate students with some background 
in homological algebra. JD 


Optimization, T(16—-17: 1), P. Maxima and 
Minima with Applications: Practical Optimiza- 
tion and Duality. Wilfred Kaplan. Ser. in Disc. 
Math. & Optim. Wiley, 1999, x + 284 pp, 
$74.95. [ISBN 0-471-25289-1] 4 chapters: 
basic concepts and geometric aspects; problems 
with side conditions; optimization and math- 
ematical programming; Frenchel—Rockafellar 
duality theory. Prerequisites: (advanced) cal- 
culus and linear algebra. AO 

Optimization, T(16—17: 1), P, L. Integer Pro- 
gramming. Laurence A. Wolsey. Ser. in Disc. 
Math. & Optimiz. Wiley, 1998, xviii + 264 pp, 
$59.95. [ISBN 0-471-28366-5] Incorporates 
recent developments (e.g., cutting plane theory, 
heuristic methods). Assumes some knowledge 
of linear programming and graph theory, but 
otherwise self-contained. AO 


Optimal Control, P. Geometric Control and 
Non-Holonomic Mechanics. V. Jurdjevic, R.W. 
Sharpe. Conf. Proc., V. 25. AMS, 1998, xi + 
239 pp, $49 (P). [ISBN 0-8218-0795-1] Pro- 
ceedings of a 1996 conference in Mexico City. 


Optimal Control, P. Mathematical Control 
Theory. Eds: J. Baillieul, J.C. Willems. 
Springer-Verlag, 1999, xxxii + 360 pp, $59.95. 


486 


TELEGRAPHIC REVIEWS 


[ISBN 0-387-98317-1] 9 papers on the devel- 
opment of control theory over the last 30 years. 
Focus is on areas influenced by R.W. Brockett. 


Probability, S, P, L. The Design Inference: 
Eliminating Chance Through Small Probabili- 
ties.. William A. Dembski. Stud. in Prob., In- 
duction, & Decision Theory. Cambridge Univ 
Pr, 1998, xvii + 243 pp, $54.95. [ISBN 0-521- 
62387-1] Not a text but a philosophical tract 
about when one can infer design behind events 
of very small probability. Thought provoking, 
fun to read, full of interesting examples. SN . 


Stochastic Processes, P. Ergodicity and Sta- 
bility of Stochastic Processes. A.A. Borovkov. 
Transl: V. Yurinsky. Ser. in Prob. & Stat. Wi- 
ley, 1998, xxiii + 585 pp, $175. [ISBN 0-471- 
97913-9] In-depth treatment of the asymp- 
totic behavior and resulting invariant distribu- 
tions of Markov chains and some of their gen- 
eralizations. Includes applications to queueing 
and communication networks. SN 


Elementary Statistics, S(14: 1), L. Basic Busi- 
ness Statistics: A Casebook. Dean P. Fos- 
ter, Robert A. Stine, Richard P. Waterman. 
Springer-Verlag, 1998, xvi+ 244 pp, $34.95 (P). 
[ISBN 0-387-98354-6] 11 classes of related 
case studies that each develop a single, key 
statistical idea and show how to use statistics 
to answer business questions. Book meant 
to replace lectures. Topics: summary statis- 
tics, sources of variation, standard error, con- 
fidence intervals, sampling, hypothesis testing, 
design of experiments, introduction to regres- 
sion. Uses JMP but also provides Minitab com- 
mands. Includes three assignments. Data avail- 
able via the web. KB 


Elementary Statistics, $(14: 1), L. Business 
Analysis Using Regression: A Casebook. Dean 
P. Foster, Robert A. Stine, Richard P. Water- 
man. Springer-Verlag, 1998, xvii + 348 pp, 
$39.95 (P). [ISBN 0-387-98356-2] Compan- 
ion volume to Basic Business Statistics; same 
general structure but with 12 classes of cases. 
Topics: fitting equations to data regression as- 
sumptions, prediction and confidence intervals, 
multiple regression, modeling categorical fac- 
tors, one- and two-way ANOVA, modeling cat- 
egorical response, time series. (1997 edition, 
TR, March 1998.) KB 


Elementary Statistics, T(13: 2), C. Statisti- 
cal Methods for Engineers. G. Geoffrey Vin- 
ing. Duxbury Pr (Wadsworth), 1998, xv + 
479 pp, $72.95, with disk. [ISBN 0-534- 
23706-1] Lays solid foundation for applica- 
tion of statistics within an engineering context; 
presents students with statistical tools used by 
practicing engineers. Implements ABET’s cur- 


[May 


riculum recommendations for teaching engi- 
neering statistics. Uses real engineering cases 
and data. Extensive graphical analysis through- 
out. Encourages computer use. KB 


Statistical Methods, P. Computer Assisted Sur- 
vey Information Collection. Eds: Mick P. 
Couper, et al. Ser. in Prob. & Stat. Surv. 
Methodology Sec. Wiley, 1998, xvi + 653 pp, 
$89.95. [ISBN 0-471-17848-9] An authorita- 
tive and comprehensive review of the field. 29 
papers in 8 sections: Introduction and Histori- 
cal Overview; Transition to CASIC; Instrument 
Design; Issues in Survey Design; Case Man- 
agement; Interviewers as Users of CASIC; Self- 
Administered Surveys; Emerging Technologies 
in CASIC. 


Statistical Methods, P. A Practical Guide to 
Heavy Tails: Statistical Techniques and Appli- 
cations. Eds: Robert J. Adler, Raisa E. Feld- 
man, Murad S. Taqqu. Birkhduser Boston, 
1998, xvi + 533 pp, $59.95. [ISBN 0-8176- 
3951-9] 24 expository papers on applications, 
data analytic techniques, and models for heavy- 
tailed distributions and processes. Aimed at 
general practitioners. 7 sections: Applications; 
Time Series; Heavy-Tail Estimation; Regres- 
sion; Signal Processing; Model Structures; Nu- 
merical Procedures. 


Statistical Methods, T(15-17: 2), C, P, L. 
Methods for Business Analysis and Forecasting: 
Text and Cases. Peter Tryfos. Wiley, 1998, xiv 
+ 576 pp, $84.95, with disk. [ISBN 0-471- 
12384-6] Covers the principal methods for 
analysis and forecasting including linear mod- 
els, regression, and econometrics. Deals with 
models for relationships involving quantitative 
or qualitative dependent and explanatory vari- 
ables. Examples, problems, and cases use real 
data. Emphasizes model formulation and inter- 
pretation rather than computation. Includes 16 
extensive cases. KB 


Applications (Communication Theory), 
T(15-17: 1), P, L. Fourier Analysis and Ap- 
plications: Filtering, Numerical Computation, 


Wavelets. C. Gasquet, P. Witomski. Transl: R. 


Ryan. Texts in Appl. Math., V. 30. Springer- 
Verlag, 1999, xviii + 442 pp, $49.95. [ISBN 0- 
387-98485-2] Begins with material on signal 
processing and filters. Discusses convergence 
of Fourier series. Introduces Lebesgue theory, 
Hilbert space, and distribution theory; applies 
these theories to filters and sampling. Modular 
presentation. PG 


Applications (Communication Theory), 
T(16: 1, 2), S, P, L. Wavelet Analysis: The 
Scalable Structure of Information. Howard L. 


1999] 


TELEGRAPHIC REVIEWS 


Resnikoff, Raymond O. Wells, Jr. Springer- 
Verlag, 1998, xvi + 435 pp, $59.95. [ISBN 
0-387-98383-X] Introductory monograph on 
wavelets and their applications to signal- 
processing. In four main parts: basics on in- 
formation theory; wavelet theory; wavelet ap- 
proximation and methods; applications. Clear 
and readable, with many illustrations, figures, 
and examples. No exercises. PZ 

Applications (Economics), T??(15-16), S**, 
L. A Gyide to Econometrics, Fourth Edition. 
Peter Kennedy. MIT Pr, 1998, xiii + 468 pp, 
$18.95 (P). [ISBN 0-262-61140-6] Econo- 
metric problems arise from situations where 
usual basic assumptions underlying linear re- 
gression model are violated. Econometrics 
texts are “catalogs” of which estimators are de- 
sirable in what situations. Shows how to “read 
the catalog” and deal with the problems. (Third 
Edition, TR, March 1993.) RM 

Applications (Engineering), T*(14—-15: 2). 
Analytical and Computational Methods of Ad- 
vanced Engineering Mathematics. Grant B. 
Gustafson, Calvin H. Wilcox. Texts in Appl. 
Math., V. 28. Springer-Verlag, 1998, xxii + 
729 pp, $59.95. [ISBN 0-387-98265-5] Cov- 
ers standard body of material (ODEs, vector cal- 
culus, linear algebra, PDEs) using modern per- 
spectives. Emphasizes numerical techniques 
from the beginning, acknowledges the avail- 
ability of computer algebra systems. Uses real- 
world problems extensively. AO 


Applications, T(14: 1). All You Wanted To 
Know About Mathematics But Were Afraid To 
Ask: Mathematics for Science Students, Volume 
2. Louis Lyons. Cambridge Univ Pr, 1998, xv 
+ 382 pp, $27.95 (P); $69.95. [ISBN 0-521- 
43601-X; 0-521-43466-1] Entertaining and 
accessible exposition. Topics include multivari- 
able and vector calculus, PDEs, Fourier series, 
normal modes, waves, and linear algebra. AO 


Applications, P. Lecture Notes in Control and 
Information Sciences—237: The Confluence of 
Vision and Control. Eds: David J. Kriegman, 
Gregory D. Hager, A. Stephen Morse. Springer- 
Verlag, 1998, xii + 281 pp, $85 (P). [ISBN 
1-85233-025-2] Proceedings of a 1997 work- 
shop held on Block Island, Rhode Island. Pa- 
pers discuss theoretical results, empirical inves- 
tigations, and applications. 


Reviewers 


KB: Karla Ballman, Macalester; JD: Jill Dietz, St. Olaf; 
PG: Philip Gloor, St. Olaf; RM: Richard Molnar, 
Macalester; SN: Sam Northshield, Carleton; AO: Amold 
Ostebee, St. Olaf; KS: Karen Saxe, Macalester; KES: Kay 
E. Smith, St. Olaf; PZ: Paul Zorn, St. Olaf. 


487 


NE VW/i Site Licenses and Student Pricing. 
: See www.derive.com 


)»> 
EFRIVE ts the trusted 


mathematical assistant relied 

upon by students, educators, 
engineers, and scientists around 
‘the world. It does for algebra, 
equations, trigonometry, vectors, 
matrices, and calculus what the 
scientific calculator does for 
numbers — it eliminates the 
drudgery of performing long and. 
tedious mathematical 


for Windows 


: you the freedom to explore 
different mathematical approaches 
better and more quickly than by 
using traditional methods. 


System Requirements: 
Windows 95, 3.1x or NT running 
on a computer with 8 megabytes 
of memory. 

Suggested Retail Price: $250. 
calculations. You can easily Educational pricing available. 


solve both symbolic and numeric For product information and list of 
problems and see the results plotted as dealers, fax, email, write, or call Soft 


2D or 3D graphs. Warehouse, Inc. or visit our website at 
For everyday mathematical work http://www.derive.com. 
DERIVE \s a tireless, powerful, and 


knowledgeable assistant. For teaching The Easiest just got Easier. 
or learning mathematics, DERIVE gives ee ee ee ee eee eee 


Soft Warchouse: Soft Warehouse, Inc. * 3660 Waialae Avenue 
HONOLULU*HAWAII Suite 304 * Honolulu, Hawaii, USA 96816-3259 

© 1996 Soft Warehouse, Inc. DERIVE is a registered trademark of Soft Warehouse, Telephone: (808) 734-5801 after 10:00 a.m. PST 

Inc. Other trademarks are the property of their respective owners. Fax: (808) 735-1105 * Email: swn@aloha.com. 


Princeton=Math 


Gnomon 


From Pharaohs to Fractals 
Midhat J. Gazaleée 


The beaver’s tooth and the tiger’s claw. 
Sunflowers and seashells. Fractals, Fibonacci 
sequences, and logarithmic spirals. These diverse 
_ forms of nature and mathematics are united by a 
common factor: all involve self-repeating shapes, 
or gnomons. | 


Gnomon is an engaging and beautifully pro- 
duced book that will appeal to anyone interest- 
ed in the wonders of geometry and 
mathematics. 


“A splendid introduction to the surprising 
properties of gnomons. . . . You put down 
[this] book with a heightened sense of awe and 
wonder. . . .”—Martin Gardner 


24 color illustrations. 124 black and white illustrations. 


Cloth $29.95 ISBN 0-691-00514-1 Due May 


The Topology of Fibre 


Bundles 
-Norman Steenrod 


Fibre bundles, now an integral part of differ- 


ential geometry, are also of great importance in 
modern physics—such as in gauge theory. This 
book, a succinct introduction to the subject by 
renowned mathematician Norman Steenrod, 
was the first to present the subject systemati- 
cally. 

Princeton Landmarks in Mathematics 

Paper $19.95 ISBN 0-691-00548-6 Due May 


CONGRATULATIONS TO | 
Curtis McMullen, winner of the Fields Medal 
and Elias Stein, winner of the Wolf Prize 


Real Submanifolds in 
Complex Space and 


Their Mappings 
M. Salah Baouendi, Peter Ebenfelt, and 
Linda Preiss Rothschild 


This book presents many of the main devel- 
opments of the past two decades in the study of 
real submanifolds in complex space, providing 
crucial background material for researchers and 
advanced graduate students. The authors 
include extensive preliminary material to make 
the book accessible to nonspecialists. 


Princeton Mathematical Series, 47: 
John N. Mather and Elias M. Stein, Editors 


Cloth $69.50 ISBN 0-691-00498-6 


Mathematical Methods of 
Statistics 


~ Harald Cramér 


This is a classic of statistical mathematical 
theory. The first part is an introduction to the 
fundamental concept of a distribution and of 
integration with respect to a distribution. The 
second part contains the general theory of ran- 
dom variables and probability distributions while 
the third is devoted to the theory of sampling, 
statistical estimation, and tests of significance. 
Princeton Landmarks in Mathematics 
Paper $24.95 ISBN 0-691-00547-8 Due May 


Princeton University Press 


AT FINE BOOKSTORES OR CALL 800-777-4726 © HTTP://PUP.PRINCETON.EDU 


THE MATHEMATICAL ASSOCIATION OF AMERICA 


The Lessons 
ofa Mather 


Series: Spectrum 


A perfect gift for the new teacher. . . or for anyone inter- 
ested in the teaching of mathematics. 


This book is the legacy of twenty years of mathematics 
teaching. During this time, the author searched for moti- 
vation techniques, mnemonics, insightful proofs, and 
serious applications of humor to aid his teaching. The 
result is this book: part philosophy, part humor, and part 
biography. Readers will be amused and enlightened on 


every page. 


Mr. Stueben shows how he has used humor and word- 
play to motivate his students. The book is filled with 
wonderful problems and proofs, as well as the author's 
insights about how to approach teaching problem solving 
to high school students. Sections of the book also treat 
the use of calculators and computers in the classroom. A 
section on mnemonics shows how teachers can use mem- 
ory aids to help their students learn and retain material. 


Phone in Your Order Now! @ 1-800-331-1622 
Monday — Friday 8:30 am — 5:00 pm 


wv, 
[| 


Rr 
es a 


My, 


Twenty Years Before 
the Blackboard. 


The Lessons and Humor of a Mathematics Teacher 


Michael Stueben with Diane Sandford 


All in all, Twenty Years Before the Blackboard provides a 
goldmine of ideas for the classroom teacher. Although Mr. 
Stueben taught at the high school level, his book is an excel- 
lent “methods” book for mathematics teachers at all levels. 


Read what Martin Gardner has to say about this fascinat- 
ing book: 


It’s been decades since I read so entertaining a book about 
mathematics. The book is a treasure-trove of mathematical 
jokes, rhymes, anecdotes, word play, mnemonics, and beautiful 
proofs. For teachers there is an abundance of wise advice 
based on the author’s twenty years in high school teaching. 
Mathematicians at all levels, from amateurs, to college profes- 
sors will not only chuckle over its gems, but learn much they 
did not know before. ——Martin Gardner 


Catalog Code: TYB/JR98 
174 pp., Paperbound, 1998, ISBN 0-88385-525-9 
List: $29.50 MAA Member: $23.50 


FAX (301) 206-9789 


or mail to: The Mathematical Association of America, PO Box 91112, Washington, DC 20090-1112 


Shipping and Handling: Postage and handling are charged as follows: USA orders (shipped via UPS): $2.95 for the first book, and $1.00 for each additional book. Canadian 
orders: $4.50 for the first book and $1.50 for each additional book. Canadian orders will be shipped within 10 days of receipt of order via the fastest available route. We do not 
ship via UPS into Canada unless the customer specially requests this service. Canadian customers who request UPS shipment will be billed an additional 7% of their total order. 
Overseas orders: $3.50 per item ordered for books sent surface mail. Airmail service is available at a rate of $7.00 per book. Foreign orders must be paid in US dollars through a 
US bank or through a New York clearinghouse. Credit Card orders are accepted for all customers. 


Address 
City State Zip 


Phone 


QTY. CATALOG CODE PRICE AMOUNT 
TYB/J R98 

All orders must be prepaid with the excep- Shipping & handling 

tion of books purchased for resale by book- 

stores and wholesalers. TOTAL 

Payment (LJ Check (J VISA (J MasterCard 

Credit Card No. Expires /__ 


Signature 


If you teach calculus, you should read this book. If you 
want to know what mathematics your students understand, 
or if you want to know how to find out what they under- 
stand, this book contains essential information for you. 


It doesn’t matter whether you teach a reform or traditional 
course, whether you have large or small sections, or 
whether you use lectures or laboratories. The bottom line 
is the same: When all is said and done, what counts is 
what our students understand. And that’s what Student 
Assessment in Calculus is about. 


Over the last ten years calculus instruction has changed 
in numerous ways. Whether they were trying on new 
ideas or following the more traditional routes towards 
conceptual understanding, both individual faculty and 
departments needed to know if their instruction was 
effective. To help deal with that issue, the National Science 
Foundation brought together a Working Group of experts 
in students’ mathematical thinking, in assessment, and in 
calculus reform. The goals of their work were to: 
e develop a framework to tailor calculus instruction to the 
students’ needs; 


THE MATHEMATICAL ASSOCIATION OF AMERICA 


Pbone in Your Order Now! ®B 1-800-331-1622 
Monday — Friday 8:30 am — 5:00 pm 


Student Assessment 
in Calculus 


A Report of the NSF Working Group on 
Assessment in Calculus 


ALAN SCHOENFELD, EDITOR 
Series: MAA Notes 


¢ establish an agenda for further research on student 
understanding; 

e describe how to make use of a range of techniques to 
test what students know, such as multiple-choice tests or 
short essay questions, student portfolios and “clinical” 
interviews; 

¢ summarize major goals of the reform movement and 
describe the challenges faced by those who are taking a 
closer look at how students learn; 

e illustrate the ways in which calculus projects attempt 
(via exams, papers, projects, etc.) to find out what their 
students have learned. 


This book is the result of those efforts. If you teach calcu- 
lus, if you want to see examples of useful assessment tech- 
niques, or if you are interested in issues of how to mea- 
sure student learning in mathematics, then there is a lot for 
you here. 


Catalog Code: NTE-43/JR97 

122 pp., Paperbound, 1997 

ISBN 0-88385-152-0 

List: $34.95 MAA Member: $29.00 


FAX (301) 206-9789 


or mail to: The Mathematical Association of America, PO Box 91112, Washington, DC 20090-1112 


Shipping and Handling: Postage and handling are charged as follows: USA orders (shipped via UPS): $2.95 for the first book, and $1.00. for each additional book. 
Canadian orders: $4.50 for the first book and $1.50 for each additional book. Canadian orders will be shipped within 10 days of receipt of order via the fastest avail- 
able route. We do not ship via UPS into Canada unless the customer specially requests this service. Canadian customers who request UPS shipment will be billed an 
additional 7% of their total order. Overseas orders: $3.50 per item ordered for books sent surface mail. Airmail service is available at a rate of $7.00 per book. 
Foreign orders must be paid in US dollars through a US bank or through a New York clearinghouse. Credit Card orders are accepted for all customers. 


Address 
City | State Zip 


Phone 


OTY. CATALOG CODE PRICE AMOUNT 
NTE-43/JR97 

All orders must be prepaid with the Shipping & handling 

exception of books purchased for 

resale by bookstores and wholesalers. TOTAL 

Payment [J Check UJ VISA LJ MasterCard 

Credit Card No. Expires __ /__ 


Signature 


[Logic 
a> 


A Ree L ra 


roe eT Pork geo 
PAGIII 4 DOS 


STEVE POAT 


THE MATHEMATICAL ASSOCIATION OF AMERICA 2 J 


Logic as Algebra 


ws 


Doletam Mathematreal Exponbom—Ho 21 


The Mathematical Association of Amenca 


@ Paul Halmos and Steven Givant 


Series: Dolciani Mathematical Expositions 


This book is based on the notes of a course in logic given by 
Paul Halmos. This book retains the spirit and purpose of 
those notes, which was to show that logic can (and perhaps 
should) be viewed from an algebraic perspective. When so 
viewed, many of its principal notions are seen to be old 
friends, familiar algebraic notions that were “disguised” in logi- 
cal clothing. Moreover, the connection between the principal 
theorems of the subject and well-known theorems in algebra 
becomes clearer. Even the proofs often gain in simplicity. 


Propositional logic and monadic predicate calculus—predicate 
logic with a single quantifier— are the principal topics treated. 
The connections between logic and algebra are carefully 
explained. The key notions and the fundamental theorems are 
elucidated from both a logical and algebraic perspective. The 
final section gives a unique and illuminating algebraic treatment 
of the theory of syllogisms—perhaps the oldest branch of logic, 
and a subject that is neglected in most modern logic texts. 


The. presentation is aimed at a broad audience—mathematics 
amateurs, students, teachers, philosophers, linguists, computer 
scientists, engineers, and professional mathematicians. 


Whether the reader’ goal is a quick glimpse of modern logic or / 


a more serious study of the subject, the book’ fresh approach 
will bring novel and illuminating insights to beginners and pro- 
fessionals alike. All that is required of the reader is an acquain- 
tance with some of the basic notions encountered in a first 
course in modern algebra. In particular, no prior knowledge of 
logic is assumed. The book could serve equally well as a fire- 
side companion and as a course text. 


Contents: What is Logic?: To count or to think; A small 
alphabet; A small grammar; A small logic; What is truth?; 
Motivation of the small language; All mathematics. 
Propositional Calculus: Propositional symbols; 
Propositional abbreviations; Polish notation; Language as 
an algebra; Concatenation,; Theorem schemata; Formal 
proofs; Entailment; Logical equivalence; Conjunction; 
Algebraic identities. Boolean Algebra: Equivalence class- 
es; Interpretations; Consistency and Boolean algebra; 
Duality and commutativity; Properties of Boolean alge- 
bras; Subtraction; Examples of Boolean algebras. 
Boolean Universal Algebra: Subalgebras, 
Homomorphisms; Examples of homomorphisms; Free 
algebras; Kernels and ideals; Maximal ideals; 
Homomorphism theorem; Consequences; The represen- 
tation theorem. Logic via Algebra: Pre-Boolean algebras; 
Substitution rule; Boolean logics; Algebra of the proposi- 
tional calculus; Algebra of proof and consequence. 
Lattices and Infinite Operations: Lattices; Non-distribu- 
tive lattices; Infinite operations. Monadic Predicate 
Calculus: Propositional functions; Finite functions; 
Functional monadic algebras; Functional quantifiers; 
Properties of quantifiers; Monadic algebras; Free monadic 
algebras; Modal logics; Monadic logics; Syllogisms. 


Catalog Code: DOL-21/JR98 
152 pp., Paperbound, 1998, ISBN 0-88385-327-2 
List: $27.00 MAA Member: $21.95 


Phone in Your Order Now! @ 1-800-331-1622 


Monday — Friday 8:30 am — 5:00 pm 


FAX (301) 206-9789 


or mail to: The Mathematical Association of America, PO Box 91112, Washington, DC 20090-1112 


Shipping and Handling: Postage and handling are charged as follows: USA orders (shipped via UPS): $2.95 for the first book, and $1.00 for each additional book. Canadian 
orders: $4.50 for the first book and $1.50 for each additional book. Canadian orders will be shipped within 10 days of receipt of order via the fastest available route. We do not 
ship via UPS into Canada unless the customer specially requests this service. Canadian customers who request UPS shipment will be billed an additional 7% of their total order. 
Overseas orders: $3.50 per item ordered for books sent surface mail. Airmail service is available at a rate of $7.00 per book. Foreign orders must be paid in US dollars through a 
US bank or through a New York clearinghouse. Credit Card orders are accepted for all customers. 


Address 
City State Zip 


Phone 


QTY. CATALOG CODE PRICE AMOUNT 
DOL-21/JR98 

All orders must be prepaid with the excep- Shipping & handling 

tion of books purchased for resale by book- 

stores and wholesalers. TOTAL 

Payment L] Check [1 VISA (J MasterCard 

Credit Card No. Expires /_ 


Signature 


odeling in 

the Environment | \ 

thitupdeithadyeovh Rodery eg aaaiK en ‘Seueaietieniiabichryrentank: sweeten iii 
bres o 


Packaged with a PC compatible disk that enhances the 
material in the text. 


Suitable for classroom adoption in an innovative course for 
¢ a general education mathematics elective 
¢ amathematics or science major advanced elective 
¢ an interdisciplinary course, even at a relatively 
elementary level 
¢ a mathematical modeling course in a 
civil/environmental engineering program 


This book has a dual objective: first, to introduce the reader 
to some of the most important and widespread environmen- 
tal issues of the day; and second, to illustrate the vital role 
played by mathematical models in investigating these issues. 
The environmental issues addressed include: ground-water 
contamination, air pollution, and hazardous material emer- 
gencies. These issues are presented in their full real-world 
context, not as scientific or mathematical abstractions; and 
for background, readers are invited to investigate their status 
in their own communities. 

The first part of the book leads the reader through rela- - 
tively elementary modeling of these phenomena, including 
simple algebraic equations for ground water, slightly more 
complex algebraic equations (preferably implemented on a 
spreadsheet or other computerized framework) for air pollu- 
tion, and a fully computerized modeling package for haz- 
ardous materials incident analysis. The interplay between 
physical intuition and mathematical analysis is emphasized. 

For more advanced readers, the second part of the book 
returns to the same three subjects but with a higher level of 
mathematical sophistication (adjustable to the preparation of 
the reader by selection of subsections.) Many important clas- 
sical mathematical themes are developed through this con- 
text, examples coming from single and multivariable calcu- 
lus, differential equations, numerical analysis, linear algebra 
and probability. The material is presented in such a way as to 
minimize the required background and to encourage the 
subsequent study of some of these fields. 

_An elementary course for a general audience could be 
based entirely on Part I, and a higher level mathematics, sci- 


THE MATHEMATICAL ASSOCIATION OF AMERICA 


_ | Mathematical 
Modeling 
in the Environment 


Charles Hadlock 


Series: Classroom Resource Materials 


ence, or engineering course could move quickly to Part 2. 

A PC compatible diskette packaged with the text con- 
tains a spreadsheet program that facilitates the numerical 
experimentation with the Gaussian plume equation intro- 
duced in Chapter 3, as well as public domain DOS program 
(ARCHIE) for evaluating the consequences from various haz- 
ardous materials scenarios (e.g., the physical extent of flam- 
mable and toxic vapor clouds). Text is not tied to the use of 
this software, but it is included as an aid to meet the peda- 
gogical objectives of the text. 


Catalog Code: ENV/SA 
312 pp., Paperbound, 1998, ISBN 0-88385-709-X 
List: $55.00 MAA Member: $43.95. - 


Instructor’s and Solutions 
Manual for Mathematical 
Modeling in the Environment 


Charles Hadlock 


Contains the complete solutions and further discussion of 
nearly every exercise presented in the textbook. This includes 
both the mathematical/computational exercises as well as the 
research questions and investigations. Readers will benefit 
greatly from perusing solutions to the problems whether they 
have worked them out themselves or not. Students using this 
volume will still need to work out solutions of research ques- 
tions using their own sources and adapting them to their own 
geographic locations, or using their own computational 
schemes, so this volume could well be useful for students in 
many course contexts. Enrichment material is included on 
the topics of some of the exercises. Advice for teachers who 
lack previous environmental experience, but who want to _ 
teach this material is also provided and makes it practical for 
such persons to offer a course based on these volumes. 


Catalog Code: EVS/SA 
150 pp., Paperbound, 1998, ISBN 0-88385-713-8 
List: $18.95 MAA Member: $14.95 


Phone in Your Order Now! @& 1-800-331-1622 


Monday — Friday 8:30 am — 5:00 pm 
or mail to: The Mathematical Association of America, PO Box 91112, Washington, DC 20090-1112 . 


FAX (301) 206-9789 


Vita Mathematica 


Historical Research and 
Integration with Teaching 


Ronald Calinger, Editor 


The use of the history of mathematics in the 
teaching of mathematics at all levels is an idea 
whose time has come. To use history in the 
teaching of undergraduate mathematics, the 
instructor must be familiar with the history as 
well as the mathematics. Vita Mathematica will 
enable college teachers to learn the relevant histo- 
ry of various topics in the undergraduate curricu- 
lum and help them incorporate this history in 
their teaching. 

For example, should calculus be approached 
from a geometric or an algebraic point of view? 
The book shows us how two important eigh- 
teenth century mathematicians, Colin Maclaurin 
and Joseph-Louis Lagrange, understood the calcu- 
lus from these different standpoints and how their 
legacy is still important in teaching calculus 
today. We also learn why Lagrange’s algebraic 
approach dominated teaching in Germany in the 
nineteenth century. Some of the reasons for this 
are related to the appropriate foundations of the 
calculus, and so the book traces the ancient histo- 
ry of one of the possible foundations, the concept 
of indivisiblés. Even though we generally do not 
use this concept formally today, many ideas for a 
heuristic approach to the calculus can be devel- 
oped out of his study. 

Vita Mathematica contains numerous other 
articles dealing with calculus, with algebra, com- 


lands, 


r ) FG MATEL MATIVAL ASSUCEATEES OF AMI 


binatorics, graph theory, and geometry, as well as 
more general articles on teaching courses for 
prospective teachers. 

This volume, then, demonstrates that the his- 
tory of mathematics is no longer tangential to the 
mathematics curriculum, but in fact deserves a 
central role. 


Catalog Code: NTE40 
350 pp., Paperbound, 1996, ISBN 0-88385-097-4 
List: $34.95 MAA Member: $29.00 


ORDER FROM: 
THE MATHEMATICAL ASSOCIATION OF AMERICA 
P.O. Box 91112, Washington, DC 20090-1112 


1-800-331-1622 


ms 


Address 
City 


State Zip 


(301) 617-7800 FAX (301) 206-9789 


QTY CATALOG CODE PRICE AMOUNT 
NTE40 
TOTAL 
Payment [OJ Check © VISA’ (1 MasterCard 
Credit Card No. Expires. / 
Signature 


The Lighter Side 
of Mathematics 


Proceedings of the Eugéne Strens Memorial Conference 
on Recreational Mathematics and its History 


Richard K. Guy and 
Robert E. Woodrow, Editors 


The level of exposition is high, and the fun infectious. 
The reader can find routes to serious mathematics, 
such as hyperbolic geometry, fractals, group theory, 
and number theory, all beginning with a delightful 
puzzle. A sparkling addition for any library where the 
lover of mathematics at any level comes for support. a é i a 

| —Choice CME 1. . “whist ses pa Wein aan mien 


(HAWG 


\ 


oad 
07 
/~ 
en 

+ (om. 
= 

me 

~ 

= 

tom 


The book is a fantastic feast of far-from-trivial topics. | 
Entertaining mathematics not only can lead to unexpect- | enocespinas ov rns avotue sraans 
ed applications...but it is one of the best ways to stimu- {  mamontar comrenence om necREATIONAL 
late interest in mathematics among both students and baa: Se eTMm etn nev ae narae? 
the general public. 

—Martin Gardner, American Scientist 


THE QATREMALIORS AARSRAIALIAN GE AMERIOR 


In August of 1986 a special conference on recreational 
mathematics was held at the University of Calgary to 
celebrate the founding of the Strens Collection. Leading 
practitioners of recreational mathematics from around 
the world gathered in Calgary to share with each other 
the joy and spirit of play that is to be found in recreation- 
al mathematics. 


The papers in this volume represent a treasure trove of 

recreational mathematics by a star-studded cast: Leon Henry Dudeney, or change ringing, then this book is a 
Bankoff, Elwyn Berlekamp, H.S.M. Coxeter, Ken Falconer, must for you. 

Branko Griinbaum, Richard Guy, Doris Schattschneider, 376 pp., Paperbound, 1994 

David Singmaster, Athelstan Spilhaus, Stan Wagon and ISBN 0.88385-516.X 


many others. List: $42.95 


If you are interested in tessellations, Escher, tiling, MAA Member: $33.50 
Rubik’s cube, pentominoes, games, puzzles, the arbelos, | Catalog Code: LSMA/JR 


ORDER FROM: 
THE MATHEMATICAL ASSOCIATION OF AMERICA 
1529 Eighteenth Street, NW Washington, DC 20036 
1-800-331-1622 (301)617-7800 FAX (301) 206-9789 


Membership Code: QTY. CATALOG CODE PRICE AMOUNT 
LSMA/JR 
Name 
TOTAL 
Address 


Payment (] Check ( VISA (J MasterCard 
Ss nee ee Er Credit Card No. Expires /_ 


State Zip Signature 


THE MATHEMATICAL ASSOCIATION OF AMERICA 


Penrose Tiles to 
Trapdoor Ciphers 


... and the Return of Dr. Matrix 


CIPHE 


AND THE RETURN OF BR: 


Martin 
Gardner 


MARTIN GARDNER 
A reissue of another Gardner classic 
Series: Spectrum 


Read what reviewers have said about Penrose Tiles to Trapdoor 
Ciphers ... | 


The MAA is proud to reissue Martin 
Gardner’s Penrose Tiles to Trapdoor Ciphers, 
printed with a new bibliography, correc- 
tions to the text, and a postscript from the 
author. Penrose Tiles assembles a collection 
of Gardner’s “Mathematical Games” columns 
from Scientific American that include many 
of the problems, puzzles and paradoxes 


The scope is extraordinary ... Those fortunate enough to have 
encountered Gardner's columns in their original appearance 
can look for personal bonuses of reminiscence as they read this 
book ... Gardner ts one of history’s great figures of recreational 
mathematics. —New Scientist 


that have earned him a reputation as a 
master mathematical magician. 


Included here are chapters on Conway’s 


Penrose Tiles to Trapdoor Ciphers is invaluable to those interested 
in recreational mathematics and should enlighten those who 
consider such activity to be difficult or boring. 

—The Mathematics Teacher. 


surreal numbers, Mandelbrot’s fractals, and 
Smullyan’s logic puzzles, as well as puzzlers 
dealing with hyperbolas, negative numbers, 
-pool-ball triangles, and Penrose tiles and 
trapdoor ciphers. And of course, you can 
read of the return of Dr. Irvine Joshua 
Matrix, (famed numerologist and CIA 
operative), one of Martin Gardner’s oldest 
fictional friends. 


No popular mathematical writer has ever matched Gardner's 

breadth and richness of knowledge and clarity of style, and this 

book is up to his usual unsurpassable standard. 
: —American Scientist 


Catalog Code: TILES/JR97 
312 pp., Paperbound, 1997, ISBN 0-88385-521-6 
List: $27.95 MAA Member: $21.95 


Phone in Your Order Now! B® 1-800-331-1622 


Monday — Friday 8:30 am — 5:00 pm FAX (301) 206-9789 
or mail to: The Mathematical Association of America, PO Box 91112, Washington, DC 20090-1112 


Shipping and Handling: Postage and handling are charged as follows: USA orders (shipped via UPS): $2.95 for the first book, and $1.00 for each additional book. 
Canadian orders: $4.50 for the first book and $1.50 for each additional book. Canadian orders will be shipped within 10 days of receipt of order via the fastest avail- 
able route. We do not ship via UPS into Canada unless the customer specially requests this service. Canadian customers who request UPS shipment will be billed an 
additional 7% of their total order. Overseas orders: $3.50 per item ordered for books sent surface mail. Airmail service is available at a rate of $7.00 per book. 
Foreign orders must be paid in US dollars through a US bank or through a New York clearinghouse. Credit Card orders are accepted for all customers. 


QTY CATALOG CODE PRICE AMOUNT 
Name TILES/JR97 

All orders must be prepaid with the — Shipping & handling 
Address exception of books purchased for 

resale by booksiores and wholesalers. TOTAL 
City ce Zip Payment LL) Check UU VISA WU MasterCard 

Credit Card No. Expires /_ 
Phone Signature 


AMERICAN MATHEMATICAL SOCIETY 


The AMS is pleased to invite authors to submit manuscripts to be consid- 
ered for publication in the Student Mathematical Library, a new series of 
undergraduate studies in mathematics. 


This developing series is intended to spark undergraduates’ appreciation for 
research by introducing them to interesting topics of modern mathematics. 
By emphasizing original topics and approaches, the series aims to broaden 
students’ mathematical experiences. Books to be published in the series 
should be suitable for honors courses, upper-division seminars, reading 
courses, or self-study. 


Editorial Board: 

David M. Bressoud, Macalester College 
Robert L. Devaney (Chair), Boston University 
Carl Pomerance, University of Georgia 
Hung-Hsi Wu, University of California, Berkeley 


Volumes in the Student Mathematical Library series that would be suitable as 
continuations from standard undergraduate courses might cover topics such 
as: coding theory following on from number theory and/or algebra, Fourier 
series from analysis or ODEs, elementary PDEs from analysis and ODEs. 
Volumes that are related to topics normally seen in graduate school might 
cover: introductory differential geometry, minimal surfaces, introductory 
algebraic geometry, topics in representation theory, complex analysis, or 
probability. Other volumes might cover topics that are not standard 
elements of the curriculum, such as mathematical physics, game theory, or 
mathematics of finance. | 


These works should contain problems, either within the body of the text or 
at the end of each chapter or section. Connections to current research are 
encouraged; this may take the form of reports on recent results and, when 
appropriate, lists of open problems of continuing interest. 


SPRINGER FOR MATHEMATICS 


STAN WAGON, Macalester College, St. Paul, MN 


MATHEMATICA® IN ACTION 


Second Edition 


This second edition of 
Mathematica in Action is 
designed both as a guide to 
the extraordinary capabilities 
of Mathematica as well as a 
detailed tour of modern math- 
ematics by one of its leading 
expositors, Stan Wagon. Ideal 

oe for teachers, researchers, and 
mathematica enthusiasts, this book includes an 
eight page full color insert and fifty percent new 
material all organized around elementary topics, 
intermediate applications, and advanced projects. 
In addition, the book uses Mathematica 3.0 
throughout, notebooks for which are available on 
the TELOS web site (www.telospub.com), 


—— Btaowe 1999/608 PP., 522 ILLUS./HARDCOVER 
Mi ELOS fu  $69.95/ISBN 0-387-98252-3 


BENNO ARTMANN, Technische Hochschule Darmstadt, 
Germany 


EUCLID — THE CREATION OF 
MATHEMATICS 


The way in which Euclid presents essential fea- 
tures of mathematics has set the standard for more 
than 2000 years. By displaying the axiomatic foun- 
dation of a mathematical theory and its conscious 
development towards the solution of a specific 
problem, Euclid shows how abstraction works and 
how it enforces the strictly deductive presentation 
of a theory. Euclid — The Creation of Mathematics 
is a book for all lovers of mathematics with a solid 
background in high school geometry—from teach- 
ers and students to university professors, It is an 
atfémpt to understand the nature of mathematics 
from its most important early source. 


1999/APP. 352 PP., 116 ILLUS./HARDCOVER/$49.95 
ISBN 0-387-98423-2 


FUZHEN ZHANG, Nova Southeastern University, 
Fort Lauderdale, FL 


MATRIX THEORY 


Basic Results and Techniques 


The aim of this book is to concisely present fun- 
damental ideas, results, and techniques in linear 
algebra and mainly matrix theory. Each chapter 
focuses on the results, techniques, and methods 
that are beautiful, interesting, and representative, 
followed by carefully selected problems. Matrix 
Theory can be used as a text or a supplement for a 
‘linear algebra and matrix theory class or seminar 
for senior or graduate students. The only prereq- 
uisites are a decent background in elementary lin- 
ear algebra and calculus. 


1999/APP. 278 PP./HARDCOVER/$49.95/ISBN 0-387-98696-0 
UNIVERSITEXT 


ELIAS DEEBA and ANANDA GUNAWARDENA, both of 
University of Houston-Downtown, TX 


INTERACTIVE LINEAR ALGEBRA 
WITH MAPLE V° 


A Complete Software Package for Doing 
Linear Algebra 


Interactive Linear Algebra with Maple v is a com- 
plete software package for doing linear algebra; 
consisting of the printed book and a cbRom 
(diskettes available on request). The interactive text 
includes a collection of interactive lessons and labs, 
as well as a stand-alone testing system. Students 
using this text will learn the concepts of linear alge- 
bra and their applications in an interactive inviro- 
ment characterized by experimentation 

exploration, and discovery learning. 


1998/330 PP., CD-ROM/SOFTCOVER/$49.95 
ISBN 0-387-98240-X 
TEXTBOOKS IN MATHEMATICAL SCIENCES 


Now Also Available with 


Student Version! 


1999/330 PP. SOFTCOVER BOOK, CD-ROM, 

PLUS 274 PP. SOFTCOVER MANUAL & CD-ROM IN BOX 
$89.95 (TENT.)/ISBN 0-387-98829-7 

TEXTBOOKS IN MATHEMATICAL SCIENCES 


ROBIN WILSON and JEREMY GRAY, both of The Open 
University, Milton Keynes, England 


CLASSICS FROM THE 
MATHEMATICAL INTELLIGENCER 


1999/APP. 488 PP., 109 ILLUS./HARDCOVER/$29.95 (TENT.) 
ISBN 0-387-98686-3 


yt 

CALL: 1 800-SPRINGER or FAX: 201-348-4505 
WRITE: Springer-Verlag: New York, Dept. S289, 
PO Box 2485, Secaucus, NJ 07096-2485 
VISIT: your local technical bookstore. 

E-MAIL: orders@springer-ny.com 
INSTRUCTORS: Cail or write for information on 
textbook exam copies. 


YOUR 30-DAY RETURN PRIVILEGE IS ALWAYS GUARANTEED! 


6/99 | REFERENCE: $289 


Springer 


http: //www.springer-ny.com 


THE MATHEMATICAL ASSOCIATION OF AMERICA 


1529 Eighteenth Street, N.W. 
Washington, DC 20036 


= 


